SEQUENCE LISTING 



( 1 ) GENE RAL\ INFORMATION : 

(i) APPLICANT: Inouye, Sumiko 
A Hsu, Mei-Yin 

\ Eagle, Susan 

\ Inouye, Masayori 

(ii) TITLE OF INVENTION: Prokaryotic Reverse Transcriptase 
(iii) NUMBER OF SEQUENCES: 45 

CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Weiser & Associates 

(B) STREET: 230 South Fifteenth Street, Suite 500 

(C) CITY : \philadelphia 

(D) STATE A Pennsylvania 

(E) COUNTRY*: U.S.A. 

(F) ZIP: 19102 

COMPUTER READABLE FORM: 

(A) MEDIUM TOE: Floppy disk 

(B) COMPUTER :\ IBM PC compatible 

(C) OPERATING ^SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Ratentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/269,118 

(B) FILING DATE: Y30-JUN-1994 

(C) CLASSIFICATION; 

ATTORNEY /AGENT INFORMATION: 

(A) NAME: Weiser, Gerard J. 

(B) REGISTRATION NUMBER: 19,763 

(C) REFERENCE / DOCKETV NUMBER : 377.5888P 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-873^8383 

(B) TELEFAX: 215-875-8B94 



(2) INFORMATION FOR SEQ ID NO : 1 : \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 2176 base paifis 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS : double ) 

(D) TOPOLOGY: linear 





m 



r 3 ^ 



(v) 



(vi) 



E _ = 



(viii) 



(ix) 



FEATURE : 

(A) NAME /KEY: CDS 



% 



(B) LOCATION: 640.. 2094 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 
TGTACCGCGT TTCCCT.GGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 
CGCCGCGTCA CCAGCGdCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 
CGCGCCTCGA GAAGTTGG&G GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCTAAA 
CCCTTCAACC ACGGCTCGGQ, CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGACG 
ACGACGTGCG CTTCACGCGC \?AGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGAGCGC 
CTCGAGCGGC GGAGCGGCGT TfcCGCCGCTC CGGTTGGAAT GCAGGACACT CTCCGCAAGG 
TAGCCTGTTC TTGGCTCTCT CC^TCCTAGG CACTACGGCC AGGGTGGGTA GCGGAGCCAA 
ffiCGACGCCACC GCCGTTTACC CAC(\CCGGCC GTAGTGCCTA GGAGGGGAGA GCCGGTGAGG 

TGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 




\ CTACCGTGCC CCAGGTAAGA TGGTG\ 

{ GTCCCGTCTT CCATCGCCGC GCCCGC^CAA GGTGCAGAC ATG ACC GCC AGG CTG 

Met Thr Ala Arg Leu 

1 5 



GAC CCG TTC GTC CCC GCA GCT TCG\CCG CAG GCC GTG CCC ACG CCC GAG 
*3 Asp Pro Phe Val Pro Ala Ala Ser Kro Gin Ala val Pro Thr Pro Glu 

10 \ 15 20 



UJ CTC ACC GCT CCG TCG TCA GAC GCG GCCx GCG AAG CGT GAA GCC CGC CGG 
=fl Leu Thr Ala Pro Ser Ser Asp Ala AlaWa Lys Arg Glu Ala Arg Arg 
SJ 25 30 \ 35 

CTC GCG CAC GAA GCG TTG CTC GTC CGC GCG AAG GCC ATC GAC GAA GCG 
Leu Ala His Glu Ala Leu Leu Val Arg Alk Lys Ala lie Asp Glu Ala 
40 45 \ 50 



GGC GGC GCC GAC GAC TGG GTG CAG GCG CAG 
Gly Gly Ala Asp Asp Trp val Gin Ala Gin 
55 60 



\TC GTC TCC AAG GGG CTC 
Val Ser Lys Gly Leu 
65 



GCG GTC GAG GAC CTG GAC TTC TCC AGC GCC TCC\GAG AAG GAC AAG AAG 
Ala Val Glu Asp Leu Asp Phe Ser Ser Ala Ser slu Lys Asp Lys Lys 
70 75 80 \ 85 

GCC TGG AAG GAG AAG AAG AAG GCC GAG GCC ACC GAS CGC CGC GCG CTG 
Ala Trp Lys Glu Lys Lys Lys Ala Glu Ala Thr Glu\ Arg Arg Ala Leu 

90 95 \ 100 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
654 

702 

750 

798 

846 

894 

94 2 



AAG CGT CAG GCG CAC GAG GCG TGG AAG GCC ACG CAC 



LTG GGC CAC CTG 



990 



% 



s 



Lys 



GGC 
Gly 



GTG 
Val 



GAC 
Asp 
150 

CTC 
Leu 



Ala His 
105 

GCG Gtec GTG CAC 
Ala Gl\ Val His 
12 0 



CCC CAC 
Pro His 
135 

TCC GCG 
Ser Ala 



CGC TGG 
Arg Trp 



CGC GAG 
ArgVGlu 



Glu Ala Trp 



TGG GCG GAG 
Trp Ala Glu 
125 

GAG CGC GCC 
Glu Arg Ala 
140 



Lys Ala Thr His 
110 

GAC CGC CTG GCC 
Asp Arg Leu Ala 



GAG 

Glu Ala >Leu 
r 



CTG GCC AAG 
Ala Lys 

5 



GTG AGC TGG 



8 



p 


Val 


Ser 


Trn 


m 


TCC 


CCC 


AAG 


o 


Ser 


Pro 


Lys 


S3 






200 










UJ 


GTC 


GTG 


GAG 




Val 


Val 


Glu 






215 






'GGA 


CGC 


TCC 




Gly 


Arg 


Ser 


y 


230 








GTG 


GTC 


AAG 




Val 


Val 


Lys 




CGG 


GTG 


AAG 




Arg 


Val 


Lys 




ACG 


CTG 


CTG 




Thr 


Leu 


Leu 








280 




TTC 


CGC 


GGC 




Phe 


Arg 


Gly 






295 






CAG 


GGC 


GCC 




Gin 


Gly 


Ala 




310 







TTC 
Phe 



ACC 
Thr 
185 

CCT 
Pro 



GCG 
Ala 
170 

ATT 
He 



GAG 
Glu 



TTCXCAC CGG 
Phe >Jis Arg 



CCG AAG\CGG 
Pro Lys \rg 



CGG 
Arg 



GCG 
Ala 



GAG 
Glu 



GAC 
Asp 
190 



CTG AAG 
Leu Lys Ala 
205 



GCA\GCG 
a 



GCC AAC GGC 
Ala Asn Gly 
145 

CTG GGG CTG 
Leu Gly Leu 
160 

GTC GAC ACG 
Val Asp Thr 
175 

GGC AGC AAG 
Gly Ser Lys 



CAG CGC TGG 
Gin Arg Trp 



Val Gly 
115 

GAC GCG 
Asp Ala 
130 

CTG ACG 
Leu Thr 



AGC GTC 
Ser Val 



GCC ACG 
Ala Thr 



CGC ACG 
Arg Thr 
195 

GTG CTG 
Val Leu 
210 



His Leu 



TTC GAC 
Phe Asp 



GAG CTG 
Glu Leu 



TCC AAG 
Ser Lys 
165 

CAC TAC 
His Tyr 
180 

ATT ACG 
He Thr 



TCC AAC 
Ser Asn 




CGG CTG 
Arg Leu 



ATC CTC 
He Leu 



GTG GAC 
Val Asp 
250 

GGC CTG 
Gly Leu 
265 

TCC CTC 
Ser Leu 



AAG CTC 
Lys Leu 



CCC ACG 
Pro Thr 



CCG GTC CAC 
Pro Val His 
220 

ACC AAC GCG 
Thr Asn Ala 
235 

CTC AAG GAC 
Leu Lys Asp 



TTG CGC AAG 
Leu Arg Lys 



CTC TCC ACG 
Leu Ser Thr 
285 

CTG CAC GTC 
Leu His Val 
300 

TCG CCC GGC 
Ser Pro Gly 
315 



GGC 

Gly A\a 



CTG GCC 
Leu Ala 



TTC TTC 
Phe Phe 
255 

GGC GGC 
Gly Gly 
270 

GAA GCG 
Glu Ala 



GCC CAC 
Ala His 
225 

C CAG 
H3\s Gin 
24 1 



CTG CGG 
Leu Arg 

CCG CGG 
Pro Arg 



GGC TTC 
Gly Phe 



GGC GCG 
Gly Ala 



GTC ACC 
Val Thr 



5AG GGC 
l^u Gly 
275 



GTG GCG 
Val Ala 



GAC GTC 
Asp Val 
245 

TGG CGC 
Trp Arg 
260 

ACG TCC 
Thr Ser 



GCC AAG GGC CCG 
Ala Lys Gly Pro 
305 

ATC ACC AAC GCG 
He Thr Asn Ala 
320 



CGC GCC 
Arg Ala 



CTC TGC 
Leu Cys 



GTC CAG 
Val Gin 



VCTG CCC 
leu Pro 



1038 



1086 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



1518 



1566 



1614 



Pi 




CTC GAC AAG CGG CTG TCC GCC CTC GCG AAG CGG 
Leu Asp Ly^ Arg Leu Ser Ala Leu Ala Lys Arg 

330 335 

ACG CGC TAC 3CG GAC GAC CTG ACC TTC TCC TGG 
Thr Arg Tyr Aria Asp Asp Leu Thr Phe Ser Trp 
345\ 350 

CCC AAG CCG CGG CGG ACG CAG CGT CCC CCC GTC 
Pro Lys Pro Arg Arg Thr Gin Arg Pro Pro Val 
360 ~ \ 365 

CGC GTC CAG GAA GTG Wg GAG GCG GAG GGC TTC 
Arg Val Gin Glu Val Val Glu Ala Glu Gly Phe 
375 \ 380 

AAG ACG CGC GTC GCC CGCYaAG GGC ACG CGG CAG 
Lys Thr Arg Val Ala Arg Yys Gly Thr Arg Gin 
390 395 \ 400 

GTC GTG AAT GCG GCG GGC AA<S GAC GCG CCC GCG 
Val Val Asn Ala Ala Gly Lys\ Asp Ala Pro Ala 

410 \ 415 

GAC GTC GTC CGC CAG CTC CGC G&C GCC ATC CAC 
Asp Val Val Arg Gin Leu Arg Al\ Ala lie His 
425 \430 

AAG CCG GGC CGC GAG GGC GAG TCG CTC GAG CAG 
Lys Pro Gly Arg Glu Gly Glu Ser L*^i Glu Gin 
440 445 



GCC TTC ATC CAC ATG ACG GAC CCG GCC AAG GGC 
Ala Phe lie His Met Thr Asp Pro Ala Gly 
455 460 

CAG CTC ACG GAG CTC GAG TCC ACG GCG AGC C^CC 
Gin Leu Thr Glu Leu Glu Ser Thr Ala Ser A\a 
470 475 48 



CTG GGC TTC ACC TAC 
Leu Gly Phe Thr Tyr 
340 

ACG AAG GCG AAG CAG 
Thr Lys Ala Lys Gin 
355 

GCG GTC CTC CTG TCT 
Ala Val Leu Leu Ser 
370 

CGC GTG CAC CCG GAC 
Arg Val His Pro Asp 
385 

CGG GTC ACC GGG CTC 
Arg Val Thr Gly Leu 

405 

GCC CGA GTC CCG CGC 
Ala Arg Val Pro Arg 
420 

AAC CGG AAG AAG GGC 
Asn Arg Lys Lys Gly 
435 

CTC AAG GGC ATG GCC 
Leu Lys Gly Met Ala 
450 

CGC GCC TTC CTG GCT 
Arg Ala Phe Leu Ala 
465 

GCT CCG CAG GCG GAG 
Ala Pro Gin Ala Glu 

485 



TGACGCTCAG CGCGCGTCCG TCGCCGACGT GCCGCGCGCC ^GCAACGCCG CATTCAGCAA 
CTCCGTCAGC CGGCGCGGGT AC 



1662 

1710 

1758 

1806 

1854 

1902 

1950 

1998 

2046 

2094 

2154 
2176 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



MOLECULE TYPE: 



protein 



(xi)\SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Val L^s Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro 
1 \ 5 10 15 

Leu Thr GV Glu Lys lie Lys Ala Leu Val Glu He Cys Thr Glu Met 
20 25 30 

Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn 
35 \ 40 45 



Thr Pro Val Phe 
50 



SJa He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys 
55 60 



Leu Val Asp Phe Argv Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu 
65 \70 75 80 

Val Gin Leu Gly He P^p His Pro Ala Gly Leu Lys Lys Lys Lys Ser 

85 \ 90 95 

Val Thr Val Leu Asp Val &ly Asp Ala Tyr Phe Ser Val Pro Leu Asp 
100 \ 105 110 

Glu Asp Phe Arg Lys Tyr Thr Via Phe Thr He Pro Ser He Asn Asn 
H5 rSO 125 



Glu Thr Pro Gly He Arg Tyr Gln\Tyr 
130 135 



Asn Val Leu Pro Gin Gly Trp 
140 



Lys Gly Ser Pro Ala lie Phe Gin SeV Ser Met Thr Lys lie Leu Glu 
145 150 \ 155 160 

Pro Phe Lys Lys Gin Asn Pro Asp lie Vfel lie Tyr Gin Tyr Met Asp 

165 1T0 175 



Asp Leu Tyr Val Gly Ser Asp Leu Glu He 
180 185 



Gin His Arg Thr Lys 

190 



He Glu Glu Leu Arg Gin His Leu Leu Arg Tr^Gly Leu Thr Thr Pro 
195 200 \ 205 

Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Tr*fc Met Gly Tyr Glu 
210 215 220\ 

Leu His Pro Asp Lys Trp Thr Val Gin Pro lie Val lVi Pro Glu Lys 
225 230 235 \ 240 

Asp Ser Trp Thr Val Asn Asp lie Gin Lys Leu Val Gly %ys Leu Asn 

245 250 \ 255 



Trp Ala Ser Gin lie Tyr Pro 
260 




) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TXYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Arg Pro Trp Ala Arg. Thr Pro Pro Lys Ala Pro Arg Asn Gin Pro Val 
1 5 \ 10 15 

Pro Phe Lys Pro Glu AVg Leu Gin Ala Leu Gin His Leu Val Arg Lys 
20 \ 25 30 

Ala Leu Glu Ala Gly His\lle Glu Pro Tyr Thr Gly Pro Gly Asn Asn 
35 \ 40 45 

Pro Val Phe Pro Val Lys hyk Ala Asn Gly Thr Trp Arg Phe lie His 



iO 



55 



60 



Asp Leu Arg Ala Thr Asn Ser Lteu Thr lie Asp Leu Ser Ser Ser Ser 
65 70 \ 75 80 

Pro Gly Pro Pro Asp Leu Ser SerYeu Pro Thr Thr Leu Ala His Leu 

85 \ 90 95 

Gin Thr lie Asp Leu Arg Asp Ala Phe^ Phe Gin lie Pro Leu Pro Lys 
100 105\ 110 

Gin Phe Gin Pro Tyr Phe Ala Phe Thr Val Pro Gin Gin Cys Asn Tyr 
115 120 \ 125 

Gly Pro Gly Thr Arg Tyr Ala Trp Lys Val\Leu Pro Gin Gly Phe Lys 
130 135 \ 140 

Asn Ser Pro Thr Leu Phe Glu Met Gin Leu Aifta His He Leu Gin Pro 
145 150 155\ 160 

He Arg Gin Ala Phe Pro Gin Cys Thr He Leu G\ln Tyr Met Asp Asp 

165 170 \ 175 

He Leu Leu Ala Ser Pro Ser His Glu Asp Leu Leu\Leu Leu Ser Glu 
180 185 \ 190 

Ala Thr Met Ala Ser Leu He Ser His Gly Leu Pro Va\ Ser Glu Asn 
195 200 205> 

Lys Thr Gin Gin Thr Pro Gly Thr lie Lys Phe Leu Gly G\n He He 
210 215 220 



% H 



Ser Vro Asn His Leu Thr Tyr Asp Ala Val Pro Thr Val Pro lie Arg 
225 \ 230 235 240 

Ser Arg\Trp Ala Leu Pro Glu Leu Gin Ala Leu Leu Gly Glu lie Gin 

245 250 255 

Trp Val Se\ Lys Gly Thr Pro 

\260 

INFORMATION FoV SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH \ 259 amino acids 

(B) type: amino acid 
(D) TOPOLOGY \ linear 

(ii) MOLECULE TYPE: pVotein 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 4: 

Asn Val Leu Tyr Arg lie Glsy Ser Asp Asn Gin Tyr Thr Gin Phe Thr 
1 5 \ 10 15 

He Pro Lys Lys Gly Lys Gly Wl Arg Thr He Ser Ala Pro Thr Asp 
20 \ 25 30 

Arg Leu Lys Asp He Gin Arg ArgNTle Cys Asp Leu Leu Ser Asp Cys 
35 40 \ 45 

Arg Asp Glu He Phe Ala He Arg Lys\ lie Ser Asn Asn Tyr Ser Phe 
50 55 \ 60 

Gly Phe Glu Arg Gly Lys Ser lie He Le\u Asn Ala Tyr Lys His Arg 
65 70 \ 75 80 

Gly Lys Gin lie lie Leu Asn lie Asp Leu Bys Asp Phe Phe Glu Ser 

85 90 \ 95 

Phe Asn Phe Gly Arg Val Arg Gly Tyr Phe Leu\Ser Asn Gin Asp Phe 
100 105 \ 110 

Leu Leu Asn Pro Val Val Ala Thr Thr Leu Ala Ly\ Ala Ala Cys Tyr 
H5 120 \125 

Asn Gly Thr Leu Pro Gin Gly Ser Pro Cys Ser Pro I^e He Ser Asn 
130 135 140 

Leu lie Cys Asn lie Met Asp Met Arg Leu Ala Lys Leu\Ala Lys Lys 
145 150 155 \ 160 

Tyr Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp He Thr l\e Ser Thr 

165 170 \ 175 




Lys Asn Thr Phe Pro Leu Glu Met Ala Thr Val Gin Pro Glu Gly 
180 185 190 

Val V^l Leu Gly Lys Val Leu Val Lys Glu lie Glu Asn Ser Gly Phe 
195 200 205 

Glu lie Aen Asp Ser Lys Thr Arg Leu Thr Tyr Lys Thr Ser Arg Gin 
210 \ 215 220 

Glu Val Thr (Sly Leu Thr Val Asn Arg lie Val Asn lie Asp Arg Cys 
225 \ 230 235 240 

Tyr Tyr Lys Lys\Thr Arg Ala Leu Ala His Ala Leu Tyr Arg Thr Gly 

IA5 250 255 

Glu Tyr Lys 



INFORMATION FOR SEQ ID\NO:5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 266 amirf*? acids 

(B) TYPE: amino aci< 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID 3SfO:5 

Ala Phe His Arg Glu Val Asp Thr AlaVrhr His Tyr 
1 5 



Val Ser Trp Thr 
15 



lie Pro Lys Arg Asp Gly Ser Lys Arg Thi\ lie Thr 
20 25 

Glu Leu Lys Ala Ala Gin Arg Trp Val Leu Se^ Asn 
35 40 



Ser Pro Lys Pro 
30 

Val Val Glu Arg 
45 



Leu Pro Val His Gly Ala Ala His Gly Phe Val A\a Gly Arg Ser He 
50 55 60 



Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val 
65 70 75 



xl Val Lys Val 
80 



Asp Leu Lys Asp Phe Phe Pro Ser Val Thr Trp Arg 

85 90 



Arg \al Lys Gly 
95 



Leu Leu Arg Lys Gly Gly Leu Arg Glu Gly Thr Ser 
100 105 



Thr Lei\Leu Ser 
110 



Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin 
115 120 



Phe Arg Gl\Lys 
125 




Leu Le\i His Val Ala Lys Gly Pro Arg 
13(\ 135 



Ala Leu Pro Gin 
140 



Gly Ala Pro 



Thr Ser pVo Gly lie Thr Asn Ala Leu 
145 \ 150 



Cys Leu Lys Leu 
155 



Asp Lys Arg 
160 



Leu Ser Ala i^u Ala Lys Arg Leu Gly 

165 



Phe Thr Tyr Thr 
170 



Arg Tyr Ala 
175 



Asp Asp Leu Thr \Phe Ser Trp Thr Lys Ala Lys Gin Pro 
180 \ 185 



Lys Pro Arg 
190 



Arg Thr Gin Arg Pro Pro Val Ala Val 
195 \ 200 



Leu Leu Ser Arg 
205 



Val Gin Glu 



Val Val Glu Ala Glu (\ly 
210 



Phe Arg Val 
215 



His Pro Asp Lys 
220 



Thr Arg Val 



Ala Arg Lys Gly Thr Arg^ Gin Arg Val 
225 23( 



Thr Gly Leu Val 
235 



Val Asn Ala 
240 



Ala Gly Lys Asp Ala Pro A\a Ala Arg 

245 



Val Pro Arg Asp 
250 



Val Val Arg 
255 



Gin Leu Arg Ala Ala lie His Vsn Arg Lys 
260 \ 265 

INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Pro Thr Pro Glu Leu Thr Ala Pro Ser Ser As^Ala Ala Ala Lys Arg 
l 5 10 \ 15 

Glu Ala Arg Arg Leu Ala His Glu Ala Leu Leu vM. Arg Ala Lys Ala 
20 25 \ 30 

lie Asp Glu Ala Gly Gly Ala Asp Asp Trp Val Gin N^la Gin Leu Val 
35 40 

Ser Lys Gly Leu Ala Val Glu Asp Leu Asp Phe Ser Se^ Ala Ser Glu 
50 55 60 



Lys Asp Lys Lys Ala Trp Lys Glu Lys Lys Lys Ala Glu ALa Thr Glu 
65 70 75 80 




Arg\Arg Ala Leu Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr His 

85 90 95 

Val Gl^His Leu Gly Ala Gly Val His Trp Ala Glu Asp Arg Leu 
100 105 110 

INFORMATIOfctvFOR SEQ ID NO : 7 : 

(i) SEQUENCRVCHARACTERISTICS : 

(A) LENGTH : 110 amino acids 

(B) TYPE \ amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEk: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Asp Pro Asp Met Tht Arg Val Thr Asn Ser Pro Ser Leu Gin Ala 
l 5 \ io 15 

His Leu Gin Ala Leu Tyr ]W Val Gin His Glu Val Trp Arg Pro Leu 
20 \ 25 30 

Ala Ala Ala Tyr Gin Glu GlnVeu Asp Arg Pro Val Val Pro His Pro 
35 4v0 45 

Tyr Arg Val Gly Asp Thr Val tA Val Arg Arg His Gin Thr Lys Asn 
50 55 \ 60 

Leu Glu Pro Arg Trp Lys Gly Pro tW Thr Val Leu Leu Thr Thr Pro 
65 70 \ 75 80 

Thr Ala Leu Lys Val Asp Gly He Ala \la Trp He His Ala Ala His 

85 951 95 

Val Lys Ala Ala Asp Pro Gly Gly Gly PrcXser Ser Arg Leu 
100 105 \ 110 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : \ 
Gly Lys Asp Ala Pro Ala Ala Arg Val Pro Arg Asp Val Val Arg Gin 



% H 



5 10 15 

Leu Xrg Ala Ala lie His Asn Arg Lys Lys Gly Lys Pro Gly Arg Glu 
20 25 30 

Gly Glu\ Ser Leu Glu Gin Leu Lys Gly Met Ala Ala Phe He His Met 
x 15 40 45 

Thr Asp Pico Ala Lys Gly Arg Ala Phe Leu Ala Gin Leu Thr Glu Leu 
50 \ 55 60 

Glu Ser ThrNAla Ser Ala Ala Pro Gin Ala Glu 
65 \ 70 75 

) INFORMATION F0R\ SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH A 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy\ linear 

(ii) MOLECULE TYPE: Wotein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Lys Glu Gly His Ser Ala Arg Gin Cys Arg Ala Pro Arg Arg Gin 
1 5 \ 10 15 

Gly Cys Trp Lys Cys Gly Lys\Pro Gly His He Met Thr Asn Cys Pro 
20 \ 25 30 

Asp Arg Gin Ala Gly Phe Leu gW Leu Gly Pro Trp Gly Lys Lys Pro 
35 40\ 45 

Arg Asn Phe Pro Val Ala Gin Val \ro Gin Gly Leu Thr Pro Thr Ala 
50 55 \ 60 

Pro Pro 
65 

INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 



% 




Gly pVo Arg Ala Leu Pro Gin Gly Ala Pro Thr Ser Pro Gly lie Thr 
1 \ 5 10 15 

Asn Ala X^eu Cys Leu Lys Leu Asp Lys Arg Leu Ser Ala Leu Ala Lys 
20 25 30 

Arg Leu Gly\Phe Thr Tyr Thr Arg Tyr Ala Asp Asp Leu Thr Phe Ser 
35 \ 40 45 

Trp Thr Lys Al^a Lys Gin Pro Lys Pro Arg Arg Thr Gin Arg Pro Pro 
50 \ 55 60 

Val Ala val Leu 
65 

(2) INFORMATION FOR SEQ Vd NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 aWno acids 

(B) TYPE: amino afcid 
(D) TOPOLOGY: linear 

\ (ii) MOLECULE TYPE: prote\n 



o 



H 3 

e 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Asn Gly Thr Leu Pro Gin Oly Ser Pro Cys Ser Pro lie lie Ser 
1 5 \ 10 15 

Asn Leu lie Cys Asn lie Met Asp\Met Arg Leu Ala Lys Leu Ala Lys 
20 \25 30 

Lys Tyr Gly Cys Thr Tyr Ser Arg TVr Ala Asp Asp lie Thr lie Ser 
35 40 \ 45 

Thr Asn Lys Asn Thr Phe Pro Leu Glu\Met Ala Thr Val Gin Pro Glu 
50 55 \ 60 

Gly Val Val Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



% % 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr lAs Asn Leu Leu Pro Gin Gly Ala Pro Ser Ser Pro Lys Leu Ala 
1 \ 5 10 15 

Asn Leu\le Cys Ser Lys Leu Asp Tyr Arg lie Gin Gly Tyr Ala Gly 
20 25 30 

Ser Arg Gl$c Leu lie Tyr Thr Arg Tyr Ala Asp Asp Leu Thr Leu Ser 
35 \ 40 45 

Ala Gin Ser l^et Lys Lys Val Val Lys Ala Arg Asp Phe Leu Phe Ser 
50 \ 55 60 

lie lie Pro Ser\ 
65 

(2) INFORMATION FOR SE\) ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 7\ amino acids 

(B) TYPE: aminAacid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



Q 

Id 

S 




(xi) SEQUENCE DESCRIPTION A SEQ ID NO: 13: 

Tyr Gin Tyr Asn Val Leu Pr A Gin Gly Trp Lys Gly Ser Pro Ala lie 
1 5 \ 10 15 

Phe Gin Ser Ser Met Thr Lys iVe Leu Glu Pro Phe Lys Lys Gin Asn 
20 \ 25 30 

Pro Asp lie Val lie Tyr Gin Tyr\Met Asp Asp Leu Tyr Val Gly Ser 

45 

Lys lie Glu Glu Leu Arg Gin 
60 



35 



40 



Asp Leu Glu lie Gly Gin His Arg 
50 55 

His Leu Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Tyr Ala Trp Lys Val Leu Pro Gin Gly Phe Lys Asn Ser Pro Thr Leu 
15 10 15 

Phe Glu Met Gin Leu Ala His lie Leu Gin Pro lie Arg Gin Ala Phe 
20 25 30 

Pro Gin Cys Thr lie Leu Gin Tyr Met Asp Asp lie Leu Leu Ala Ser 
35 40 45 

Pro Ser His Glu Asp Leu Leu Leu Leu Ser Glu Ala Thr Met Ala Ser 
50 55 60 

Leu lie 
65 

(2) INFORMATION FOR SEQ ID NO: 15: 

EQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



^ 1 


(i) 


i 

S 

5 - I 


i (ii) 






0 


(xi) 




Leu 




l 


w 






Phe 







(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15; 



10 15 



20 25 30 

Pro Asp Leu lie Leu Leu Gin Tyr Val Asp Asp Leu Leu Leu Ala Ala 
35 40 45 

Thr Ser Glu Leu Asp Cys Gin Gin Gly Thr Arg Ala Leu Leu Gin Thr 
50 55 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Phe Gin Trp Lys Val Leu Pro Gin Gly Met Thr Cys Ser Pro Thr lie 
15 10 15 

Cys Gin Leu Val Val Gly Gin Val Leu Glu Pro Leu Arg Leu Lys His 
20 25 30 

Pro Ser Leu Cys Met Leu His Tyr Met Asp Asp Leu Leu Leu Ala Ala 
35 40 45 

Ser Ser His Asp Gly Leu Glu Ala Ala Gly Glu Glu Val lie Ser Thr 
50 55 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 17: 

EQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Trp Arg Val Leu Pro Gin Gly Phe lie Asn Ser Pro Ala Leu 
5 10 15 

Glu Arg Ala Leu Gin Glu Pro Leu Arg Gin Val Ser Ala Ala Phe 
20 25 30 

Gin Ser Leu Leu Val Ser Tyr Met Asp Asp lie Leu Tyr Ala Ser 
35 40 45 

Thr Glu Glu Gin Arg Ser Gin Cys Tyr Gin Ala Leu Ala Ala Arg 
50 55 60 

Leu 
65 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

lie Ala Thr Asn Gly Val Pro Gin Gly Ala Ser Thr Ser Cys Gly Leu 
15 10 15 

Ala Thr Tyr Asn Val Leu Glu Leu Phe Leu Arg Tyr Asp Glu Leu lie 
20 25 30 

Met Tyr Ala Asp Asp Gly lie Leu Cys Arg Gin Asp Pro Ser Thr Pro 
35 40 45 

Asp Phe Ser Val- Glu Glu Ala Gly Val Val Gin Glu Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
i (A) LENGTH: 61 amino acids 

^ (B) TYPE: amino acid 

i (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

§ s : 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

UJ Tyr Glu Tyr Leu Arg Met Pro Phe Gly Leu Lys Asn Ala Pro Ala Thr 

U 1 5 10 15 

5 . 2 

yj Phe Gin Arg Cys Met Asn Asp lie Leu Arg Pro Leu Leu Asn Lys His 

SJ 20 25 30 

Cys Leu Val Tyr Leu Asp Asp lie lie Val Phe Ser Thr Ser Leu Asp 
35 40 45 

Glu His Leu Gin Ser Leu Gly Leu Val Phe Glu Lys Leu 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



DO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Tyr Glu Phe Cys Arg Leu Pro Phe Gly Leu Arg Asn Ala Ser Ser lie 
15 10 15 

Phe Gin Arg Ala Leu Asp Asp Val Leu Arg Glu Gin lie Gly Lys lie 
20 25 30 

Cys Tyr Val Tyr Val Asp Asp Val He He Phe Ser Glu Asn Glu Ser 
35 40 45 

Asp His Val Arg His He Asp Thr Val Leu Lys Cys Leu 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



^ Cys Lys Leu Asn Lys Ala He Tyr Gly Leu Lys Gin Ala Ala Arg Cys 

i 5 10 15 

O Trp Phe Arg Cys lie Tyr lie Leu Asp Lys Gly Asn lie Asn Glu Asn 

t 5 20 25 30 

s""i 

yJ He Tyr Val Leu Leu Tyr Val Asp Asp Val Val He Ala Thr Gly Asp 

«3 3 5 4 0 4 5 



Met Thr Arg Met Asn Asn Phe Lys Arg Tyr Leu Met Glu Lys Phe 
50 55 SO 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Cys Leu Leu Lys Lys Ser Leu Tyr Gly Leu Lys Gin Ser Pro Arg Gin 
15 10 15 



% 



Trp Asn Ala Cys Val Tyr Val Lys Gin Val Ser Glu Gin Glu His Leu 
20 25 30 

Tyr Leu Leu Leu Tyr val Asp Asp Met Leu lie Ala Gly Lys Ser Lys 
35 40 45 

Ser Glu lie Asn Lys Val Lys Glu Gin Leu Ser Met Glu Phe 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



f*5 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 

He Arg Leu Lys Lys Ser Leu Tyr Glu Leu Lys Gin Ser Gly Ala Asn 
15 10 15 



S Trp Tyr Glu Glu Val Arg Gly Trp Ser Cys Val Phe Lys Asn Ser Gin 

20 25 30 



lis 



val Thr He Cys Leu Phe Val Asp Asp Met Val Leu Phe Ser Lys Asn 
35 40 45 



^ Leu Asn Ser Asn Lys Arg He He Glu Lys Leu Lys Met Gin Tyr 

50 55 60 



hi 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= H The 2' position of this 
nucleotide is linked to the 5" position of 
nucleotide number l of SEQ ID NO: 25 of this 
application . 11 

(ix) FEATURE: 

(A) NAME /KEY : misc_binding 

(B) LOCATION: 52 . . 58 



(D) OTHER INFORMATION: /note= "This region can hydrogen 

bond to nucleotides 61-67 of SEQ ID NO: 25 of this 
application . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CACGCAUGUA GGCAGAUUUG UUGGUUGUGA AUCGCAACCA GUGGCCUUAA UGGCAGGA 58 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
Q (B) LOCATION: 1 

gg (D) OTHER INFORMATION: /note= "The 5" position of this 

03 i nucleotide is linked to the 2 1 position of 

nucleotide number 15 of SEQ ID NO: 24 of this 
application. " 



(ix) FEATURE: 

ff (A) NAME /KEY : misc_binding 

^ (B) LOCATION: 61.. 67 

(D) OTHER INFORMATION: /note= "This region can hydrogen 

bond to nucleotides 52-58 of SEQ ID NO: 24 of this 
application . " 



3 



w 



w 

&l (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TCCTTCGCAC AGCACACCTG CCGTATAGCT CTGAATCAAG GATTTTAGGG AGGCGATTCC 6 0 

TCCTGCC 6 7 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 418.. 2175 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

TGGCCATTNA GATACGGATT TTCACTTCCT TGACAGTGCA TGACTATGCT GCATGAAATN 6 0 

GCATGATCGA TTGAGGATCG TCTTTGCTCA GATCCGCCAG AACTGGCGGG CTTTTGCTCA 12 0 

TGTCATGCAT GTGCATGAAA ACCACTGCAT AAAGCGGGCA GGCGTGGCGG GGATACGAGC 180 

GCGCGCTATC ACCGAAAATA GCCAAAATAC TTCTGGAAAA CAGAAAGTTG AAGTGATATG 24 0 

TTCATAAACA CGCATGTAGG CAGATTTGTT GGTTGTGAAT CGCAACCAGT GGCCTTAATG 3 00 

GCAGGAGGAA TCGCCTCCCT AAAATCCTTG ATTCAGAGCT ATACGGCAGG TGTGCTGTGC 3 60 

GAAGGAGTGC CTGCATGCGT TTCTCCTTGG CCTTTTTTCC T CTGGGATG A AGAAGAA 417 

ATG ACA AAA ACA TCT AAA CTT GAC GCA CTT AGG GCT GCT ACT TCA CGT 4 65 

Met Thr Lys Thr Ser Lys Leu Asp Ala Leu Arg Ala Ala Thr Ser Arg 
15 10 15 

^ i|p GAA GAC TTG GCT AAA ATT TTA GAT ATT AAG TTG GTA TTT TTA ACT AAC 513 

( /fg Glu Asp Leu Ala Lys lie Leu Asp lie Lys Leu Val Phe Leu Thr Asn 
W m 20 25 30 

Q 

^ GTT CTA TAT AGA ATC GGC TCG GAT AAT CAA TAC ACT CAA TTT ACA ATA 561 
Val Leu Tyr Arg lie Gly Ser Asp Asn Gin Tyr Thr Gin Phe Thr lie 
35 40 45 

CCG AAG AAA GGA AAA GGG GTA AGG ACT ATT TCT GCA CCT ACA GAC CGG 6 09 

s Pro Lys Lys Gly Lys Gly Val Arg Thr lie Ser Ala Pro Thr Asp Arg 

O 50 55 60 

bj 

□ TTG AAG GAC ATC CAA CGA AGA ATA TGT GAC TTA CTT TCT GAT TGT AGA 657 

yj Leu Lys Asp lie Gin Arg Arg lie Cys Asp Leu Leu Ser Asp Cys Arg 
yj 65 70 75 80 

M 

GAT GAG ATC TTT GCT ATA AGG AAA ATT AGT AAC AAC TAT TCC TTT GGT 7 05 

Asp Glu lie Phe Ala lie Arg Lys lie Ser Asn Asn Tyr Ser Phe Gly 

85 90 95 

TTT GAG AGG GGA AAA TCA ATA ATC CTA AAT GCT TAT AAG CAT AGA GGC 753 
Phe Glu Arg Gly Lys Ser lie lie Leu Asn Ala Tyr Lys His Arg Gly 
100 105 110 

AAA CAA ATA ATA TTA AAT ATA GAT CTT AAG GAT TTT TTT GAA AGC TTT 801 
Lys Gin lie lie Leu Asn lie Asp Leu Lys Asp Phe Phe Glu Ser Phe 
115 120 125 

AAT TTT GGA CGA GTT AGA GGA TAT TTT CTT TCC AAT CAG GAT TTT TTA 84 9 

Asn Phe Gly Arg Val Arg Gly Tyr Phe Leu Ser Asn Gin Asp Phe Leu 
130 135 140 

TTA AAT CCT GTG GTG GCA ACG ACA CTT GCA AAA GCT GCA TGC TAT AAT 897 
Leu Asn Pro Val Val Ala Thr Thr Leu Ala Lys Ala Ala Cys Tyr Asn 



145 










150 










155 










160 




GGA 
Gly 


ACC 
Thr 


CTC 
Leu 


CCC 
Pro 


CAA 
Gin 
165 


GGA 
Gly 


AGT 
Ser 


CCA 
Pro 


TGT 
Cys 


TCT 
Ser 
170 


CCT 
Pro 


ATT 
He 


ATC 
He 


TCA 
Ser 


AAT 
Asn 
175 


CTA 
Leu 


945 


ATT 

He 


TGC 
Cys 


AAT 
Asn 


ATT 
He 
180 


ATG 
Met 


GAT 
Asp 


ATG 
Met 


AGA 
Arg 


TTA 
Leu 
185 


GCT 
Ala 


AAG 
Lys 


CTG 
Leu 


GCT 
Ala 


AAA 
Lys 
190 


AAA 
Lys 


TAT 
Tyr 


993 


GGA 
Gly 


TGT 
Cys 


ACT 
Thr 
195 


TAT 
Tyr 


AGC 
Ser 


AGA 
Arg 


TAT 
Tyr 


GCT 
Ala 
200 


GAT 
Asp 


GAT 
Asp 


ATA 
He 


ACA 
Thr 


ATT 
lie 
205 


TCT 
Ser 


ACA 
Thr 


AAT 
Asn 


1041 


AAA 
Lys 


AAT 
Asn 
210 


ACA 
Thr 


TTT 
Phe 


CCG 
Pro 


TTA 

Leu 


GAA 
Glu 
215 


ATG 
Met 


GCT 
Ala 


ACT 
Thr 


GTG 
Val 


CAA 
Gin 
220 


CCT 
Pro 


GAA 
Glu 


GGG 
Gly 


GTT 
Val 


1089 


GTT 
Val 
p225 


TTG 
Leu 


GGA 
Gly 


AAA 
Lys 


GTT 
Val 


TTG 
Leu 
230 


GTA 
Val 


AAA 
Lys 


GAA 
Glu 


ATA 
He 


GAA 
Glu 
235 


AAC 
Asn 


TCT 
Ser 


GGA 
Gly 


TTC 
Phe 


GAA 
Glu 
240 


1137 


§ATA 
pile 


AAT 
Asn 


GAT 
Asp 


TCA 
Ser 


AAG 
Lys 
245 


ACT 
Thr 


AGG 
Arg 


CTT 
Leu 


ACG 
Thr 


TAT 
Tyr 
250 


AAG 
Lys 


ACA 
Thr 


TCA 
Ser 


AGG 
Arg 


CAA 
Gin 
255 


GAA 
Glu 


1185 


^ GTA 
^ Val 


ACG 
Thr 


GGA 
Gly 


CTT 
Leu 
260 


ACA 
Thr 


GTT 
Val 


AAC 
Asn 


AGA 
Arg 


ATC 
He 
265 


GTT 
val 


AAT 
Asn 


ATT 
He 


GAT 
Asp 


AGA 
Arg 
270 


TGT 
Cys 


TAT 
Tyr 


1233 


Qtat 
UJ Tyr 


AAA 
Lys 


AAA 
Lys 
275 


ACT 
Thr 


CGG 
Arg 


GCG 
Ala 


TTG 
Leu 


GCA 
Ala 
280 


CAT 
His 


GCT 
Ala 


TTG 
Leu 


TAT 
Tyr 


CGT 
Arg 
285 


ACA 
Thr 


GGT 
Gly 


GAA 

Glu 


1281 


w 

J3 TAT 
SJ Tyr 


AAA 
Lys 

290 


GTG 
Val 


CCA 
Pro 


GAT 
Asp 


GAA 
Glu 


AAT 
Asn 

295 


GGT 
Gly 


GTT 
val 


TTA 
Leu 


GTT 
Val 


TCA 
Ser 
300 


GGA 
Gly 


GGT 
Gly 


CTG 
Leu 


GAT 
Asp 


1329 


AAA 
Lys 
305 


CTT 
Leu 


GAG 
Glu 


GGG 
Gly 


ATG 
Met 


TTT 
Phe 
310 


GGT 
Gly 


TTT 
Phe 


ATT 
He 


GAT 
Asp 


CAA 
Gin 
315 


GTT 
Val 


GAT 
Asp 


AAG 
Lys 


TTT 
Phe 


AAC 
Asn 
320 


1377 


AAT 
Asn 


ATA 
He 


AAG 
Lys 


AAA 
Lys 


AAA 
Lys 
325 


CTG 
Leu 


AAC 
Asn 


AAG 
Lys 


CAA 
Gin 


CCT 
Pro 
330 


GAT 
Asp 


AGA 
Arg 


TAT 
Tyr 


GTA 
Val 


TTG 
Leu 
335 


ACT 
Thr 


1425 


AAT 
Asn 


GCG 
Ala 


ACT 
Thr 


TTG 
Leu 
340 


CAT 
His 


GGT 
Gly 


TTT 
Phe 


AAA 
Lys 


TTA 
Leu 
345 


AAG 
Lys 


TTG 
Leu 


AAT 
Asn 


GCG 
Ala 


CGA 
Arg 
350 


GAA 
Glu 


AAA 
Lys 


1473 


GCA 
Ala 


TAT 
Tyr 


AGT 
Ser 


AAA 
Lys 


TTT 
Phe 


ATT 
He 


TAC 
Tyr 


TAT 
Tyr 


AAA 
Lys 


TTT 
Phe 


TTT 
Phe 


CAT 
His 


GGC 
Gly 


AAC 
Asn 


ACC 
Thr 


TGT 
Cys 


1521 



% H 



CCT 
Pro 


ACG 
Thr 
370 


ATA 
He 


ATT 
He 


ACA 
Thr 


GAA 
Glu 


GGG 
Gly 
375 


AAG 
Lys 


ACT 
Thr 


GAT 
Asp 


CGG 
Arg 


ATA 
He 
380 


TAT 
Tyr 


TTG 
Leu 


AAG 
Lys 


GCT 
Ala 


1569 


GCT 
Ala 
385 


TTG 
Leu 


CAT 
His 


TCT 
Ser 


TTG 
Leu 


GAG 
Glu 
390 


ACA 
Thr 


TCA 
Ser 


TAT 
Tyr 


CCT 
Pro 


GAG 
Glu 
395 


TTG 
Leu 


TTT 
Phe 


AGA 
Arg 


GAA 
Glu 


AAA 
Lys 
400 


1617 


ACA 
Thr 


GAT 
Asp 


AGT 
Ser 


AAA 
Lys 


AAG 
Lys 
405 


AAA 
Lys 


GAA 
Glu 


ATA 
He 


AAT 
Asn 


CTT 
Leu 
410 


AAT 
Asn 


ATA 

He 


TTT 
Phe 


AAA 
Lys 


TCT 
Ser 
415 


AAT 
Asn 


1665 


\jJ-\JA 

Glu 


Lys 


Thr 


AAA 

Lys 
420 


TAT 
Tyr 


TTT 
Phe 


TTA 
Leu 


GAT 
Asp 


CTT 
Leu 
425 


TCT 
Ser 


GGG GGA 
Gly Gly 


ACT 
Thr 


GCA 
Ala 
430 


GAT 
Asp 


CTG 
Leu 


1713 


AAA 
Lys 


AAA 
Lys 


TTT 
Phe 
435 


GTA 
Val 


GAG 
Glu 


CGT 
Arg 


TAT 
Tyr 


AAA 
Lys 
440 


AAT 
Asn 


AAT 
Asn 


TAT 
Tyr 


GCT 
Ala 


TCT 
Ser 
445 


TAT 
Tyr 


TAT 
Tyr 


GGT 
Gly 


1761 



□ TCT GTT CCA AAA CAG CCA GTG ATT ATG GTT CTT GAT AAT GAT ACA GGT 1809 
gj Ser Val Pro Lys Gin Pro Val lie Met Val Leu Asp Asn Asp Thr Gly 
450 455 460 



% CCA AGC GAT TTA CTT AAT TTT CTG CGC AAT AAA GTT AAA AGC TGC CCA 1857 
— Pro Ser Asp Leu Leu Asn Phe Leu Arg Asn Lys Val Lys Ser Cys Pro 
H 465 470 475 480 

Us? 

*"* GAC GAT GTA ACT GAA ATG AGA AAG ATG AAA TAT ATT CAT GTT TTC TAT 1905 

L Asp Asp Val Thr Glu Met Arg Lys Met Lys Tyr lie His Val Phe Tyr 
O 485 490 495 

UJ 

O AAT TTA TAT ATA GTT CTC ACA CCA TTG AGT CCT TCC GGC GAA CAA ACT 1953 

W Asn Leu Tyr He Val Leu Thr Pro Leu Ser Pro Ser Gly Glu Gin Thr 

yl 500 505 510 

TCA ATG GAG GAT CTT TTC CCT AAA GAT ATT TTA GAT ATC AAG ATT GAT 2001 
Ser Met Glu Asp Leu Phe Pro Lys Asp He Leu Asp He Lys He Asp 
515 520 525 

GGT AAG AAA TTC AAC AAA AAT AAT GAT GGA GAC TCA AAA ACG GAA TAT 2049 
Gly Lys Lys Phe Asn Lys Asn Asn Asp Gly Asp Ser Lys Thr Glu Tyr 
530 535 540 

GGG AAG CAT ATT TTT TCC ATG AGG GTT GTT AGA GAT AAA AAG CGG AAA 2097 
Gly Lys His He Phe Ser Met Arg Val Val Arg Asp Lys Lys Arg Lys 
545 550 555 560 

ATA GAT TTT AAG GCA TTT TGT TGT ATT TTT GAT GCT ATA AAA GAT ATA 214 5 

He Asp Phe Lys Ala Phe Cys Cys He Phe Asp Ala He Lys Asp He 

565 570 575 

AAG GAA CAT TAT AAA TTA ATG TTA AAT AGC TAATGAACAG CCCTAACGTT 2195 
Lys Glu His Tyr Lys Leu Met Leu Asn Ser 



% H 



580 585 

ATGAACGCTA AGGCTGATTT TTCGTTAAAA TTTATATGGT TTGAATTGTA ATATATTATC 2255 

TTCAAGCCAT TTATTTAATT CCTGCATCCT TTTCTGTAAG GGTATTAATT CGTTCCTCAC 2315 

AAACACTAAA CTCGCTTTTT CCACATCCCC AAACCCCCCT AACATTATTC GGCATAATCC 2375 

CCATCATTTG CGGTGGCACA CGATGCGCTG CCATCATGTC ATCGCGGC 2423 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 amino acids 

(B) TYPE: amino acid 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



) (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

V Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro 
15 10 15 



W Leu Thr Glu Glu Lys He Lys Ala Leu Val Glu He Cys Thr Glu Met 
M* 20 25 30 

O Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn 
U 35 40 45 



w 



Thr Pro Val Phe Ala He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys 

50 55 60 

Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu 
65 70 75 80 

Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser 

85 90 95 

Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp 
100 105 HO 

Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn 
115 120 125 

Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp 
130 135 140 

Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu 
145 150 155 160 



% 




Pro Phe Lys Lys 



Gin Asn Pro Asp 
165 



lie val lie Tyr 
170 



Gin Tyr Met Asp 
175 



Asp Leu Tyr Val Gly Ser Asp Leu 
180 



Glu lie Gly Gin 
185 



His Arg Thr Lys 
190 



He Glu Glu 
195 



Leu Arg Gin His Leu 

200 



Leu Arg Trp Gly 



Leu Thr Thr Pro 
205 



Asp Lys Lys 
210 



His Gin Lys Glu Pro 
215 



Pro Phe Leu Trp 
220 



Met Gly Tyr Glu 



Leu His Pro Asp 
225 



Lys Trp Thr Val 
230 



Gin Pro He Val 
235 



Leu Pro Glu Lys 
240 



Asp Ser Trp Thr 



Val Asn Asp He 
245 



Gin Lys Leu Val 
250 



Gly Lys Leu Asn 
255 



Trp Ala Ser Gin He Tyr Pro Gly 
260 



He Lys Val Arg 
265 



Gin Leu Cys Lys 
270 



Leu Leu Arg Gly 
275 



Thr Lys Ala Leu 
280 



Thr Glu Val He 



Pro Leu Thr Glu 
285 



5 - s 



Glu Ala Glu Leu 
290 

Val His Gly Val 
305 



Glu Leu Ala Glu 
295 

Tyr Tyr Asp Pro 
310 



Asn Arg Glu He 
300 

Ser Lys Asp Leu 
315 



Leu Lys Glu Pro 



He Ala Glu He 
320 



Gin Lys Gin Gly 



Gin Gly Gin Trp 
325 



Thr Tyr Gin He 
330 



Tyr Gin Glu Pro 
335 



Phe Lys Asn Leu Lys Thr Gly Lys 
340 



Thr Asn Asp 
355 



Val Lys Gin Leu Thr 

360 



Tyr Ala Arg Met 
345 

Glu Ala Val Gin 



Arg Gly Ala His 
350 

Lys lie Thr Thr 
365 



Glu Ser lie 
370 



Val lie Trp Gly Lys 
375 



Thr Pro Lys Phe 
380 



Lys Leu Pro lie 



Gin Lys Glu 
385 



Thr Trp Glu Thr Trp 
390 



Trp Thr Glu Tyr 
395 



Trp Gin Ala Thr 
400 



Trp lie Pro Glu 



Trp Glu Phe Val 
405 



Asn Thr Pro Pro 
410 



Leu Val Lys Leu 
415 



Trp Tyr Gin Leu Glu Lys Glu Pro 
420 



lie Val Gly Ala 
425 



Glu Thr Phe Tyr 
430 



Val Asp Gly 
435 



Ala Ala Asn Arg Glu 

440 



Thr Lys Leu Gly 



Lys Ala Gly Tyr 
445 



Val Thr Asn Lys Gly Arg Gin Lys Val Val Pro Leu Thr Asn Thr Thr 
450 455 460 

Asn Gin Lvs Thr Glu Leu Gin Ala lie Tyr Leu Ala Leu Gin Asp Ser 
465 - 470 475 480 

Gly Leu Glu Val Asn lie Val Thr Asp Ser Gin Tyr Ala Leu Gin lie 

485 490 495 

lie Gin Ala Gin Pro Asp Lys Ser Glu Ser Glu Leu Val Asn Gin lie 
500 505 510 

lie Glu Gin Leu lie Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro 
515 520 525 

Ala His Lys Gly lie Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser 
530 535 540 

Ala Gly 
545 

INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DEI 

Arg Pro Trp Ala 
1 

Pro Phe Lys Pro 
20 

Ala Leu Glu Ala 

35 

Pro Val Phe Pro 

50 

Asp Leu Arg Ala 
65 

Pro Gly Pro Pro 



Gin Thr lie Asp 
100 



;CRIPTION: SEQ Ij 

Arg Thr Pro Pro 

5 

Glu Arg Leu Gin 



Gly His He Glu 
40 

Val Lys Lys Ala 
55 

Thr Asn Ser Leu 
70 

Asp Leu Ser Ser 
85 

Leu Arg Asp Ala 



► NO: 28: 

Lys Ala Pro Arg 
10 

Ala Leu Gin His 
25 

Pro Tyr Thr Gly 



Asn Gly Thr Trp 
60 

Thr He Asp Leu 
75 

Leu Pro Thr Thr 
90 

Phe Phe Gin He 
105 



Asn Gin Pro Val 
15 

Leu Val Arg Lys 
30 

Pro Gly Asn Asn 
45 

Arg Phe He His 



Ser Ser Ser Ser 
80 

Leu Ala His Leu 
95 

Pro Leu Pro Lys 
110 



i-A 
D 



e=w=t 



Gin Phe Gin Pro Tyr Phe Ala Phe Thr Val Pro Gin Gin Cys Asn Tyr 
115 120 125 

Gly Pro Gly Thr Arg Tyr Ala Trp Lys Val Leu Pro Gin Gly Phe Lys 
130 135 140 

Asn Ser Pro Thr Leu Phe Glu Met Gin Leu Ala His lie Leu Gin Pro 
145 150 155 160 

lie Arg Gin Ala Phe Pro Gin Cys Thr lie Leu Gin Tyr Met Asp Asp 

165 170 175 

lie Leu Leu Ala Ser Pro Ser His Glu Asp Leu Leu Leu Leu Ser Glu 
180 185 190 

Ala Thr Met Ala Ser Leu lie Ser His Gly Leu Pro Val Ser Glu Asn 
195 200 205 

Lys Thr Gin Gin Thr Pro Gly Thr He Lys Phe Leu Gly Gin He He 

210 215 220 

Ser Pro Asn His Leu Thr Tyr Asp Ala Val Pro Thr Val Pro lie Arg 
225 230 235 240 

Ser Arg Trp Ala Leu Pro Glu Leu Gin Ala Leu Leu Gly Glu He Gin 

245 250 255 

Trp Val Ser Lys Gly Thr Pro Thr Leu Arg Gin Pro Leu His Ser Leu 
260 265 270 

Tyr Cys Ala Leu Gin Arg His Thr Asp Pro Arg Asp Gin He Tyr Leu 



W 275 280 285 



Asn Pro Ser Gin Val Gin Ser Leu val Gin Leu Arg Gin Ala Leu Ser 
290 295 300 

Gin Asn Cys Arg Ser Arg Leu Val Gin Thr Leu Pro Leu Leu Gly Ala 
305 310 315 320 

He Met Leu Thr Leu Thr Gly Thr Thr Thr Val Val Phe Gin Ser Lys 

325 330 335 

Glu Gin Trp Pro Leu Val Trp Leu His Ala Pro Leu Pro His Thr Ser 
340 345 350 

Gin Cys Pro Trp Gly Gin Leu Leu Ala Ser Ala Val Leu Leu Leu Asp 
355 360 365 

Lys Tyr Thr Leu Gin Ser Tyr Gly Leu Leu Cys Gin Thr He His His 
370 375 380 

Asn He Ser Thr Gin Thr Phe Asn Gin Phe He Gin Thr Ser Asp His 
385 390 395 400 



Pro Ser Val Pro lie Leu Leu His His Ser His Arg Phe Lys Asn Leu 

405 410 415 

Gly Ala Gin Thr Gly Glu Leu Trp Asn Thr Phe Leu Lys Thr Ala Ala 
420 425 430 

Pro Leu Ala Pro Val Lys Ala Leu Met Pro Val Phe Thr Leu Ser Pro 
435 440 445 

Val lie lie Asn Thr Ala Pro Cys Leu Phe Ser Asp Gly Ser Thr Ser 
450 455 460 

Arg Ala Ala Tyr lie Leu Trp Asp Lys Gin lie Leu Ser Gin Arg Ser 
465 470 475 480 

Phe Pro Leu Pro Pro Pro His Lys Ser Ala Gin Arg Ala Glu Leu Leu 

485 490 495 

Gly Leu Leu His Gly Leu Ser Ser Ala Arg Ser Trp Arg Cys Leu Asn 
500 505 510 

He Phe Leu Asp Ser Lys Tyr Leu Tyr His Tyr Leu Arg Thr Leu Ala 
515 520 525 

Leu Gly Thr Phe Gin Gly Arg Ser Ser Gin Ala Pro Phe Gin Ala Leu 
g£ 530 535 540 

W Leu Pro Arg Leu Leu Ser Arg Lys Val Val Tyr Leu His His Val Arg 

H 545 550 555 560 

O Ser His Thr Asn Leu Pro Asp Pro He Ser Arg Leu Asn Ala Leu Thr 

Ly 565 570 575 

'ess? 

bj Asp Ala 

(2) INFORMATION FOR SEQ ID NO: 29: 



jy 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asn Val Leu Tyr Arg lie Gly Ser Asp Asn Gin Tyr Thr Gin Phe Thr 
15 10 15 

He Pro Lys Lys Gly Lys Gly Val Arg Thr He Ser Ala Pro Thr Asp 
20 25 30 



Arg Leu Lys Asp lie Gin Arg Arg lie Cys Asp Leu Leu Ser Asp Cys 
35 40 45 

Arg Asp Glu He Phe Ala He Arg Lys He Ser Asn Asn Tyr Ser Phe 
50 55 60 

Gly Phe Glu Arg Gly Lys Ser He He Leu Asn Ala Tyr Lys His Arg 
65 70 75 80 

Gly Lys Gin He He Leu Asn He Asp Leu Lys Asp Phe Phe Glu Ser 

85 90 95 

Phe Asn Phe Gly Arg Val Arg Gly Tyr Phe Leu Ser Asn Gin Asp Phe 
100 105 HO 

Leu Leu Asn Pro Val Val Ala Thr Thr Leu Ala Lys Ala Ala Cys Tyr 
115 120 125 

Asn Gly Thr Leu Pro Gin Gly Ser Pro Cys Ser Pro He He Ser Asn 

130 135 140 

P ,. 

CO \[| Leu He Cys Asn He Met Asp Met Arg Leu Ala Lys Leu Ala Lys Lys 
145 150 155 160 

Tyr Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp He Thr He Ser Thr 
Q 165 170 175 

2 Asn Lys Asn Thr Phe Pro Leu Glu Met Ala Thr Val Gin Pro Glu Gly 

180 185 190 



val Val Leu Gly Lys Val Leu Val Lys Glu He Glu Asn Ser Gly Phe 
195 200 205 



W Glu He Asn Asp Ser Lys Thr Arg Leu Thr Tyr Lys Thr Ser Arg Gin 

*0 210 215 220 

M 

Glu Val Thr Gly Leu Thr Val Asn Arg He val Asn He Asp Arg Cys 
225 230 235 240 

Tyr Tyr Lys Lys Thr Arg Ala Leu Ala His Ala Leu Tyr Arg Thr Gly 

245 250 255 

Glu Tyr Lys Val Pro Asp Glu Asn Gly Val Leu val Ser Gly Gly Leu 
260 265 270 

Asp Lys Leu Glu Gly Met Phe Gly Phe He Asp Gin Val Asp Lys Phe 
275 280 285 

Asn Asn He Lys Lys Lys Leu Asn Lys Gin Pro Asp Arg Tyr Val Leu 
290 295 300 

Thr Asn Ala Thr Leu His Gly Phe Lys Leu Lys Leu Asn Ala Arg Glu 
305 310 315 320 



Lys Ala Tyr Ser Lys Phe lie Tyr Tyr Lys Phe Phe His Gly Asn Thr 

325 330 335 

Cys Pro Thr lie lie Thr Glu Gly Lys Thr Asp Arg lie Tyr Leu Lys 
340 345 350 

Ala Ala Leu His Ser Leu Glu Thr Ser Tyr Pro Glu Leu Phe Arg Glu 
355 360 365 

Lys Thr Asp Ser Lys Lys Lys Glu lie Asn Leu Asn lie Phe Lys Ser 
370 375 380 

Asn Glu Lys Thr Lys Tyr Phe Leu Asp Leu Ser Gly Gly Thr Ala Asp 
385 390 395 400 

Leu Lys Lys Phe Val Glu Arg Tyr Lys Asn Asn Tyr Ala Ser Tyr Tyr 

405 410 415 

Gly Ser Val Pro Lys Gin Pro Val He Met Val Leu Asp Asn Asp Thr 
420 425 430 

Gly Pro Ser Asp Leu Leu Asn Phe Leu Arg Asn Lys Val Lys Ser Cys 
435 440 445 

|" Pro Asp Asp Val Thr Glu Met Arg Lys Met Lys Tyr He His Val Phe 
450 455 460 

Tyr Asn Leu Tyr He Val Leu Thr Pro Leu Ser Pro Ser Gly Glu Gin 
465 470 475 480 



s - i 



Thr Ser Met Glu Asp Leu Phe Pro Lys Asp He Leu Asp lie Lys He 

485 490 495 



^ Asp Gly Lys Lys Phe Asn Lys Asn Asn Asp Gly Asp Ser Lys Thr Glu 

^ 500 505 510 

Si 

Tyr Gly Lys His He Phe Ser Met Arg Val Val Arg Asp Lys Lys Arg 
515 520 525 

Lvs He Asp Phe Lys Ala Phe Cys Cys He Phe Asp Ala lie Lys Asp 
530 535 540 

lie Lys Glu His Tyr Lys Leu Met Leu Asn Ser 
545 550 555 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Arg Trp Phe Ser Phe His Arg Glu val Asp Thr Gly Thr His Tyr Gin 
15 10 1*5 

Thr Trp Glu lie Pro Lys Arg Asp Gly Gly Lys Arg Thr Leu Thr Ala 
20 25 30 

Pro Lys Arg Glu Leu Lys Ala Val Gin Arg Trp Val Leu Ala Asn Val 
35 40 45 

Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala Gly 
50 55 60 

Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val Val 
65 70 75 80 

Val Lys Val Asp Met Lys Asp Phe Phe Pro Ser Val Thr- Trp Pro Arg 

85 90 95 

rMil val l y s g1 y Leu Leu Ar 9 l y s G1 y G1 Y Leu Pro Glu Asn Leu Ala Thr 

100 105 110 

Leu Leu Ala Leu Leu Ser Thr Glu Ala Pro Arg Glu Val Val Arg Phe 
115 120 125 

Arg Gly Glu Thr Leu Tyr Val Ala Lys Gly Pro Arg Ala Leu Pro Gin 
130 135 140 



M Gly Ala Pro Thr Ser Pro Ala Leu Thr Asn Ala Leu Cys Leu Arg Leu 

W 145 150 155 160 



W Asp Lys Arg Leu Ser Ala Leu Ser Lys Arg Leu Gly Phe Thr Tyr Thr 

03 165 170 175 



M 



Arg Tyr Ala Asp Asp Leu Thr Phe Ser Trp Arg Arg Ala Lys Lys Ser 
180 185 190 

Arg Gin Lys Glu Leu Pro Leu Ala Asp Ala Pro Val Ala Leu Leu Leu 
195 200 205 

Ala Arg Val Lys Gly Val Leu Glu Ala Glu Gly Phe Thr Leu His Pro 
210 215 220 

Asp Lys Thr Arg Val Gin Arg Lys Gly Ser Arg Gin Arg Val Thr Gly 
225 230 235 240 

Leu Val Val 



(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS; 



(A) LENGTH; 241 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



fir 



1 val Lys val Asp Leu Lys Asp Phe Phe Pro Ser val Thr Trp Arg Arg 

85 90 95 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Arq Trp Phe Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr val 
15 10 15 

Ser Trp Thr He Pro Lys Arg Asp Gly Ser Lys Arg Thr He Thr Ser 

20 25 30 

Pro Lys Pro Glu Leu Lys Ala Ala Gin Arg Trp Val Leu Ser Asn Val 
35 40 45 

Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala Gly 
50 55 60 

Arq Ser He Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val val 
65 70 75 80 



val Lys Gly Leu Leu Arg Lys Gly Gly Leu Arg Glu Gly Thr Ser Thr 
100 105 HO 

Leu Leu Ser Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin Phe 
115 120 125 

Pro Arg Glu Leu Leu His Val Ala Lys Gly Pro Arg Ala Leu Pro Gin 
130 135 140 

Gly Ala Pro Thr Ser Pro Gly He Thr Asn Ala Leu Cys Leu Lys Leu 
145 150 155 160 

Asp Lys Arg Leu Ser Ala Leu Ala Lys Arg Leu Gly Phe Thr Tyr Thr 

165 170 175 

Arq Tyr Ala Asp Asp Leu Thr Phe Ser Trp Thr Lys Ala Lys Gin Pro 
y 180 185 190 

Lys Pro Arg Arg Thr Gin Arg Pro Pro Val Ala Val Leu Leu Ser Arg 
195 200 205 

Val Gin Glu val Val Glu Ala Glu Gly Phe Arg Val His Pro Asp Lys 
210 215 220 

Thr Arg Val Ala Arg Lys Gly Thr Arg Gin Arg Val Thr Gly Leu Val 
225 230 235 240 



Val 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



I 



hi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ara His Tyr Ser He His Arg Pro Arg Glu Arg Val Arg His Tyr Val 

l 5 10 15 . 

Thr Phe Ala val Pro Lys Arg Ser Gly Gly Val Arg Leu Leu His Ala 
20 25 30 

Pro Lys Arg Arg Leu Lys Ala Leu Gin Arg Arg Met Leu Ala Leu Leu 
35 40 45 

val Ser Lys Leu Pro Val Ser Pro Gin Ala His Gly Phe Val Pro Gly 
50 55 SO 

Arq Ser He Lys Thr Gly Ala Ala Pro His Val Gly Arg Arg Val Val 
— 70 75 ■ 80 



65 



Leu Lys Leu Asp Leu Lys Asp Phe Phe Pro Ser Val Thr Phe Ala Arg 

85 90 95 

Val Arg Gly Leu Leu Lys Ala Leu Gly Tyr Gly Tyr Pro Val Ala Ala 
100 105 I 10 

Thr Leu Ala Val Leu Met Thr Glu Ser Glu Arg Gin Pro Val Glu Leu 
115 120 125 

Glu Gly He Leu Phe His Val Pro Val Gly Pro Arg Val Cys Val Gin 
130 135 140 



Gly Ala Pro Thr Ser Pro Ala Leu Cys Asn Ala Val Leu Leu Arg Leu 
145 150 155 



160 



Asp Arg Arg Leu Ala Gly Leu Ala Arg Arg Tyr Gly Tyr Thr Tyr Thr 

165 170 I 75 

Arq Tyr Ala Asp Asp Leu Thr Phe Ser Gly Asp Asp Val Thr Ala Leu 
3 180 185 190 

Glu Arg Val Arg Ala Leu Ala Ala Arg Tyr Val Gin Glu Glu Gly Phe 
195 200 205 




Glu Val Asn Arg Glu Lys Thr Arg Val Gin Arg Arg Gly Gly Ala Gin 
210 215 220 

Arg val Thr Gly Val Thr val 
225 230 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



J3S 

E-tUS 



w 

Q 

w 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Phe Leu Thr Asn Val Leu Tyr Arg He Gly Ser Asp Asn Gin Tyr Thr 
15 10 15 

Gin Phe Thr He Pro Lys Lys Gly Lys Gly Val Arg Thr He Ser Ala 
20 25 30 

Pro Thr Asp Arg Leu Lys Asp He Gin Arg Arg He Cys Asp Leu Leu 
35 40 45 

Ser Asp Cys Arg Asp Glu He Phe Ala He Arg Lys He Ser Asn Asn 

50 55 60 

Tyr Ser Phe Gly Phe Glu Arg Gly Lys Ser He He Leu Asn Ala Tyr. 
65 • 70 75 80 

Lys His Arg Gly Lys Gin He He Leu Asn He Asp Leu Lys Asp Phe 

85 90 95 

Phe Glu Ser Phe Asn Phe Gly Arg Val Arg Gly Tyr Phe Leu Ser Asn 
100 105 HO 

Gin Asp Phe Leu Leu Asn Pro Val Val Ala Thr Thr Leu Ala Lys Ala 
115 120 125 

Ala Cys Tyr Asn Gly Thr Leu Pro Gin Gly Ser Pro Cys Ser Pro He 
130 135 140 

lie Ser Asn Leu He Cys Asn lie Met Asp Met Arg Leu Ala Lys Leu 
145 150 155 160 

Ala Lys Lys Tyr Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp He Thr 

165 170 175 

He Ser Thr Asn Lys Asn Thr Phe Pro Leu Glu Met Ala Thr Val Gin 
180 185 190 



Pro Glu Gly Val Val Leu Gly Lys Val Leu Val Lys Glu He Glu Asn 
195 200 205 

Ser Gly Phe Glu He Asn Asp Ser Lys Thr Arg Leu Thr Tyr Lys Thr 
210 215 220 

Ser Arg Gin Glu Val Thr Gly Leu Thr Val 
225 230 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Lii 



: a 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Val Glu Thr Leu Arg Leu Leu He Tyr Thr Ala Asp Phe Arg Tyr Arg 
15 10 15 

He Tvr Thr Val Glu Lys Lys Gly Pro Glu Lys Arg Met Arg Thr He 
20 25 30 

Tyr Gin Pro Ser Arg Glu Leu Lys Ala Leu Gin Gly Trp Val Leu Arg 

35 40 45 

Asn He Leu Asp Lys Leu Ser Ser Ser Pro Phe Ser He Gly Phe Glu 
50 55 60 

Lvs His Gin Ser He Leu Asn Asn Ala Thr Pro His He Gly Ala Asn 
J - 75 80 



65 



70 



Phe He Leu Asn He Asp Leu Glu Asp Phe Phe Pro Ser Leu Thr Ala 

85 90 95 

Asn Lys Val Phe Gly Val Phe His Ser Leu Gly Tyr Asn Arg Leu He 
100 105 HO 

Ser Ser Val Leu Thr Lys He Cys Cys Tyr Lys Asn Leu Leu Pro Gin 
115 120 125 

Gly Ala Pro Ser Ser Pro Lys Leu Ala Asn Leu He Cys Ser Lys Leu 
130 135 140 

Asp Tyr Arg He Gin Gly Tyr Ala Gly Ser Arg Gly Leu He Tyr Thr 
145 150 155 160 

Arc Tvr Ala Asp Asp Leu Thr Leu Ser Ala Gin Ser Met Lys Lys val 

IfiR 170 175 



Val Lys Ala Arg Asp Phe Leu Phe Ser lie lie Pro Ser Glu Gly Leu 
180 185 190 

Val lie Asn Ser Lys Lys Thr Cys lie Ser Gly Pro Arg Ser Gin Arg 
195 200 205 

Lys Val Thr Gly Leu Val lie 
210 215 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Thr Lys Gly Phe Ala Ser Glu Val Met Arg Ser Pro Glu Pro Pro Lys 
15 10 15 

Lys Trp Asp lie Ala Lys Lys Lys Gly Gly Met Arg Thr lie Tyr His 
20 25 30 



^ Pro Ser Ser Lys Val Lys Leu lie Gin Tyr Trp Leu Met Asn Asn Val 

L 35 40 45 

Q 

W Phe Ser Lys Leu Pro Met His Asn Ala Ala Tyr Ala Phe Val Lys Asn 

□ 50 55 60 



3*3 



Arg Ser lie Lys Ser Asn Ala Leu Leu His Ala Glu Ser Lys Asn Lys 
65 70 75 80 

Tyr Tyr Val Lys lie Asp Leu Lys Asp Phe Phe Pro Ser lie Lys Phe 

85 90 95 

Thr Asp Phe Glu Tyr Ala Phe Thr Arg Tyr Arg Asp Arg lie Glu Phe 
100 105 110 

Thr Thr Glu Tyr Asp Leu Glu Leu Leu Gin Leu lie Lys Thr lie Cys 
115 120 125 

Phe lie Ser Asp Ser Thr Leu Pro lie Gly Phe Pro Thr Ser Pro Leu 
130 135 140 

lie Ala Asn Phe Val Ala Arg Glu Leu Asp Glu Lys Leu Thr Gin Lys 
145 150 155 160 

Leu Asn Ala lie Asp Lys Leu Asn Ala Thr Tyr Thr Arg Tyr Ala Asp 

165 170 175 



Asp lie He Val Ser Thr Asn Met Lys Gly Ala Ser Lys Leu He Leu 
180 185 190 

Asp Cys Phe Lys Arg Thr Met Lys Glu He Gly Pro Asp Phe Lys He 
195 200 205 

Asn He Lys Lys Phe Lys He Cys Ser Ala Ser Gly Gly Ser He Val 

210 215 220 

Val Thr Gly Leu Lys Val 
225 230 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

| (ii) MOLECULE TYPE: protein 



M 



□ 

8.3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

He Gin Arg Leu His Ala Leu Ser Asn His Ala Gly Arg His Tyr Arg 
1 5 10 15 

Arg He He Leu Ser Lys Arg His Gly Gly Gin Arg Leu Val Leu Ala 
20 25 30 

Pro Asp Tyr Leu Leu Lys Thr Val Gin Arg Asn He Leu Lys Asn Val 
35 40 45 

Leu Ser Gin Phe Pro Leu Ser Pro Phe Ala Thr Ala Tyr Arq Pro Glv 
50 55 60 



Cys Pro He Val Ser Asn Ala Gin Pro His Cys Gin Gin Pro Gin He 
65 70 75 80 

Leu Lys Leu Asp He Glu Asn Phe Phe Asp Ser He Ser Trp Leu Gin 

85 90 95 

Val Trp Arg Val Phe Arg Gin Ala Gin Leu Pro Arg Asn Val Val Thr 
100 105 110 

Met Leu Thr Trp He Cys Cys Tyr Asn Asp Ala Leu Pro Gin Gly Ala 
115 120 125 

Pro Thr Ser Pro Ala He Ser Asn Leu Val Met Arg Arg Phe Asp Glu 
130 135 140 



Arg He Gly Glu Trp Cys Gin Ala Arg Gly He Thr Tyr Thr Arg Tyr 
i45 150 155 160 




55 60 65 

CGC CGC TAC ACC CCG GGC CGG AAG AAG TGG ATG GAG GCC GCC GAG GCC 533 
Arg Arg Tyr Thr Pro Gly Arg Lys Lys Trp Met Glu Ala Ala Glu Ala 
70 75 80 85 

CGG CGG CTG TTC TCC GCC ACG CTG CGC ACG CGG AAC CGG AAC CTG AGG 581 
Arq Arq Leu Phe Ser Ala Thr Leu Arg Thr Arg Asn Arg Asn Leu Arg 

90 95 100 

GAC TTG CTG CCC GAC GAG GCA CAG CTG GCG CGC TAC GGC CTG CCG GTC 629 
Asp Leu Leu Pro Asp Glu Ala Gin Leu Ala Arg Tyr Gly Leu Pro Val 
105 HO 115 

TGG CGC ACG GAA GAG GAC GTG GCA GCG GCC CTG GGC GTC TCG GTG GGC 677 
Trp Arg Thr Glu Glu Asp Val Ala Ala Ala Leu Gly Val Ser Val Gly 
120 125 130 

GTG CTC CGC CAC TAC AGC ATC CAC CGC CCG CGC GAG CGG GTG CGG CAC 725 
Val Leu Arg His Tyr Ser lie His Arg Pro Arg Glu Arg Val Arg His 
135 140 145 

^TAC GTG ACC TTC GCC GTG CCC AAG CGC TCC GGA GGC GTC CGG CTG CTG 773 

fTyr Val Thr Phe Ala Val Pro Lys Arg Ser Gly Gly Val Arg Leu Leu 
^150 155 160 165 

yy 

OCAT GCG CCC AAG CGG CGC CTG AAG GCC CTG CAA CGC CGG ATG CTG GCG 821 

Wr±s Ala Pro Lys Arg Arg Leu Lys Ala Leu Gin Arg Arg Met Leu Ala 
M= 170 175 180 

a 

OCTC CTG GTG TCG AAG CTC CCC GTG AGT CCA CAG GCC CAT GGC TTC GTG 86 9 

LULeu Leu Val Ser Lys Leu Pro Val Ser Pro Gin Ala His Gly Phe Val 
p 185 190 195 

|.| 

ScCC GGC CGC TCC ATC AAG ACG GGC GCC GCG CCG CAC GTG GGC CGG CGG 917 
vi Pro Gly Arg Ser He Lys Thr Gly Ala Ala Pro His Val Gly Arg Arg 
200 205 210 

GTG GTC CTG AAG CTG GAC CTG AAG GAC TTC TTC CCC TCC GTC ACC TTC 965 
Val Val Leu Lys Leu Asp Leu Lys Asp Phe Phe Pro Ser Val Thr Phe 
215 220 225 

GCG CGG GTG CGA GGG CTG CTC ATC GCC CTG GGC TAC GGC TAT CCC GTG 1013 
Ala Arg Val Arg Gly Leu Leu He Ala Leu Gly Tyr Gly Tyr Pro Val 
230 235 240 245 

GCG GCC ACG CTC GCG GTG CTG ATG ACG GAG TCC GAG CGC CAG CCC GTG 1061 
Ala Ala Thr Leu Ala Val Leu Met Thr Glu Ser Glu Arg Gin Pro Val 

250 255 260 

GAG CTG GAG GGC ATC CTC TTC CAC GTT CCC GTG GGC CCA CGC GTC TGC 1109 
Glu Leu Glu Gly He Leu Phe His Val Pro Val Gly Pro Arg Val Cys 
265 270 275 



m 



Cys Asp Asp Met Thr Phe Ser Gly His Phe Asn Ala Arg Gin Val Lys 

165 170 175 

Asn Lys Val Cys Gly Leu Leu Ala Glu Leu Gly Leu Ser Leu Asn Lys 
180 185 190 

Arg Lys Gly Cys Leu He Ala Ala Cys Lys Arg Gin Gin Val Thr Gly 
195 200 205 

He Val Val 
210 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 279.. 1559 




hj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

* CTCCGAGCCC GCCTCCGAGG ACGCGCTCGC GGCCCGGGCG GCGGGGGCGG ACGCGCGGCG 60 
^ GCGGCCCACG GAGACGCTTG ACCCGGGAGA CGACGAATGA CGATAACGGC AGGTGCTCTC 12 0 

LiLS 

R GGGAGAGGCC AGGGCTCGCA GATGAGCCAT GAGTACCGCG GTGTTTCGCC GCGGGGGTGT 180 

^ TCTGTCCCCA TCTCTTCGCC AGGGTCCCAG CGTACGCAAC GCAGGGAGCC CCGGGTCCAA 24 0 

CGCCTCGCAG GTCGTCCCCT GGCCTCTTCC GGAGCACC ATG AGC TGG TTC GAC 293 

Met Ser Trp Phe Asp 
1 5 

ACC ACC CTC TCC CGG CTC AAG GGG TTG TTC AGC CGT CCC GTG ACA CGA 341 
Thr Thr Leu Ser Arg Leu Lys Gly Leu Phe Ser Arg Pro Val Thr Arg 

10 15 20 

AGC ACC ACC GGG CTG GAC GTG CCG CTG GAT GCC CAC GGA CGT CCC CAG 389 
Ser Thr Thr Gly Leu Asp Val Pro Leu Asp Ala His Gly Arg Pro Gin 

25 30 ' 35 

GAC GTC GTG ACG GAG ACG GTC TCC ACG TCG GGC CCC CTG AAG CCA GGG 43 7 

Asp Val Val Thr Glu Thr Val Ser Thr Ser Gly Pro Leu Lys Pro Gly 
40 45 50 

CAC CTG CGA CAG GTC CGC CGG GAT GCG CGG CTG CTC CCC AAG GGC GTC 485 
His Leu Arg Gin Val Arg Arg Asp Ala Arg Leu Leu Pro Lys Gly Val 



GTG 


CAG 


GGC 


GCC 


ccc 


ACG 


AGC 


CCC 


GCC 


CTG 


TGC 


AAC 


GCG 


GTG 


CTG 


CTG 


1157 


val 


Gin 


Gly 


Ala 


Pro 


Thr 


Ser 


Pro 


Ala 


Leu 


Cys 


Asn 


Ala 


val 


Leu 


Leu 


















O D C 

0 0 










0 q n 










CGA 


CTG 


GAC 


CGG 


CGG 


CTG 


GCG 


GGA 


CTG 


GCG 


CGT 


CGG 


TAC 


GGC 


TAC 


ACG 


1205 


Arg 


Leu 


Asp 


Arg 


Arg 


Leu 


Ala 


Gly 


Leu 


Ala 


Arg 


Arg 


Tyr 


Gly 


Tyr 


Thr 






2 9 5 










0 a n 

3.0 0 










one 












TAC 


ACG 


CGC 


TAC 


GCG 


GAT 


GAC 


CTC 


ACC 


TTC 


TCC 


GGC 


GAC 


GAC 


GTC 


ACG 


1253 


Tyr 


Thr 


Arg 


Tyr 


Ala 


Asp 


Asp 


Leu 


Thr 


Phe 


Ser 


Gly 


Asp 


Asp 


val 


Thr 




O ± V 










0 JL O 










0 n 










0 A o 




GCG 


CTG 


GAG 


CGA 


GTC 


CGC 


GCG 


CTG 


GCC 


GCG 


CGG 


TAC 


GTG 


CAG 


GAG 


GAA 


1301 


Ala 


Leu 


Glu 


Arg 


Val 


Arg 


Ala 


Leu 


Ala 


Ala 


Arg 


Tyr 


Val 


Gin 


Glu 


Glu 
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"DOC 
O J 3 










340 






PPP 


1 1L 


PAP 


Lii L- 






pap 


A AP 


A PP 

ALL 


PPP 

LuL 


bl Lj 


PAP 


ppp 


nan, 

L-V7V7 


GGC 


GGT 




pi v 


rr Iltr 


Pi n 


Vdi 




Arg 


Pi n 


T \rc* 

ijys 


1 11L 


Arg 


V a. J. 


PI n 


A vrt 

Arg 




Gly Gly 




A \l 
















ISO 

J J u 










355 








M GCC 


CAG 


CGC 


GTC 


ACT 


GGC 


GTC 


ACC 


GTG 


AAT 


ACG 


ACG 


CTG 


GGC 


TTG 


TCA 


1397 


/WAla 


Gin 


Arg 


Val 


Thr 


Gly 


val 


Thr 


Val 


Asn 


Thr 


Thr 


Leu 


Gly 


Leu 


Ser 




^ 63 




J W V 




















370 










a 


































ffl CGC 


GAG 


GAG 


CGG 


CCG 


CGG 


CTC 


CGG 


GCG 


ATG 


CTG 


CAC 


CAG 


GAG 


GCG 


CGG 


1445 


G Arg 


Glu 


Glu 


Arg 


Pro 


Arg 


Leu 


Arg 


Ala 


Met 


Leu 


His 


Gin 


Glu 


Ala 


Arg 




S M = 

!__! 


O / D 
































Ttcg 


GAG 


GAC 


GTC 


GAG 


GCA 


CAC 


CGC 


GCG 


CAC 


CTC 


GAC 


GGC 


CTC 


CTG 


GCC 


1493 


n Ser 


Glu 


Asp 


Val 


Glu 


Ala 


His 


Arg 


Ala 


His 


Leu 


Asp 


Gly 


Leu 


Leu 


Ala 




id 390 










395 










400 










405 




rl TAC 


GTG 


AAG 


ATG 


CTC 


AAC 


CCG 


GAG 


CAG 


GCG 


GAG 


CGG 


CTC 


GCT 


CGC 


CGG 


1541 




Val 


Lys 


Met 


Leu 


Asn 


Pro 


Glu 


Gin 


Ala 


Glu 


Arg 


Leu 


Ala 


Arg 


Arg 





^ 410 415 420 



CGC AAG CCG CGC GGG ACG TGAGCGAGGG CTCAGCTCCG GATGGGCCAG 1589 
Arg Lys Pro Arg Gly Thr 
425 

GGCCTGTCAC GCGTCCCGGC CTCCCAGTTG TCATGGCGGC CGTCCCAGTA C 1640 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 



(B) LOCATION: 763 2202 



[ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CCCACTTCCG GCGCTCGGGC TGCGCGAGGG CCCGTGCGAG CACATGATGG CGCTGCGGCT 60 

CGT CCAGGTC CGGCACCGCG CCGAGCAGGA AGCACTGCGT CAGACCCCCG CGGGCCGCCA 120 

GCTCATCCGC GCGGAGACGC GCTCCTACGT GCGGCGCGAG CCCTCCGGCC AGGAGCAGGT 18 0 

GTACCGCGTC TCATTGGATG GGAAAGTGGT GGCGGTGGAG TGGGGCCCCC GCCAGGGGGA 240 

GTCCCGCCGG CAGAAGCTCT GGTTCGACAC GGACGCCGAG GCGCGCACCG CCTACTTCAC 3 00 

GCGCCTGGAG TCCTTGGCCG CGGAGGGATA TATCGATGCG GCTGCTTCAA TGATGTAGAA 360 

CACGCAAGCC ACGGGGCCGC GGGCGCGCGG CGGAAAGGCA GGTGCGACGG AACGACAGAC 420 



lACTCGTGCGA GCGACCGAGA GAGGTCCCAA GCCATCAGCC TCAGCGCCTC GAGCGCGAGA 4 80 

JJ" GCGGCGTTGC GCCGCTCTGG TTGAATTGCA GGACACTCTC CGCAAGGTAG CCTGTTCTTG 54 0 

O GCTCTCTTCC CTCCGGTGAG TACCTCTCCG GCCGGGGAGC TGAACCAACG ACGCAACCGC 6 00 

fn 

O CGTTTCCCCG GCCGGAGAGG TACTCACCGG AGGGGAGAGC CGGTGAGGCT ACCGTGCCCC 660 

hj 

jl AGGTGAGAAG GTGGTGC CTT CGGGCCTCCC TCGACCGCTC GCGCTCCGTC GCCCTGCCCT 720 

Q GCCTCGCCCC CCCCACCTTG CTCACCGGCG CCAGGAGCCG TC ATG ACC GCC AAG 774 

s s i Met Thr Ala Lys 

- 

% CTG GAG TCA CAC GTC CCC GCC GCG CCC CCC GTC TCC GCC GAG GCG CCC 822 
M Leu Giu Ser His Val Pro Ala Ala Pro Pro Val Ser Ala Glu Ala Pro 
^5 10 15 20 

GCC CCC ACC CGT CCC GAT GCC GCG AAG CAG GAG GCC CGC CGC GCC CAC 870 
Ala Pro Thr Arg Pro Asp Ala Ala Lys Gin Glu Ala Arg Arg Ala His 

25 30 35 

CAC GAG GCG CTG CGC CTG CGG TGG AAG GCC ATC GAA GAG GCG GGC GGC 918 
His Glu Ala Leu Arg Leu Arg Trp Lys Ala lie Glu Glu Ala Gly Gly 
40 45 50 

ACG GAC GCC TGG GTG CGG CAG CAG CTG GTG GCC AAG GGC GTC GCG GCG 966 
Thr Asp Ala Trp Val Arg Gin Gin Leu Val Ala Lys Gly Val Ala Ala 
55 * 60 65 

GAA GAG GTG GAC TTC GAG TCG CTC AGC GAC AAG CAG AAG GCG GCC TGG 1014 
Glu Glu val Asp Phe Glu Ser Leu Ser Asp Lys Gin Lys Ala Ala Trp 
70 75 80 

AAG GAG AAG AAG AAG GCC GAG GCC ACC GAG CGG CGC GCG CAG AAG CGC 1062 





Lys Glu Lys Lys Lys Ala Glu Ala Thr Glu Arg Arg Ala Gin Lys Arg 
85 90 95 100 

CTG GCG TGG GAG GCC TGG AAG GCC ACG CAC ATC CAC CAC CTG GGC GTG 1110 
Leu Ala Trp Glu Ala Trp Lys Ala Thr His lie His His Leu Gly Val 

105 110 115 

GGG GTG CAC TGG GAC GAG GCC GGA GGG CCG GAC AAG TTC GAC GTG GCC 1158 
Gly Val His Trp Asp Glu Ala Gly Gly Pro Asp Lys Phe Asp Val Ala 
120 125 130 

GGG CGC GAG GAG CGG GCC AAG GCC AAC GGC TTG CCG GAG GGG TTG GAC 1206 
Gly Arg Glu Glu Arg Ala Lys Ala Asn Gly Leu Pro Glu Gly Leu Asp 
135 140 145 

TCG GTC GAG GCG CTG GCC AAA GCG CTG GGC ATC TCC GTG TCG CGC CTG 1254 
Ser Val Glu Ala Leu Ala Lys Ala Leu Gly lie Ser Val Ser Arg Leu 
150 155 160 

CGC TGG TTC TCC TTC CAC CGC GAG GTG GAC ACG GGC ACG CAC TAC CAG 13 02 

Arg Trp Phe Ser Phe His Arg Glu Val Asp Thr Gly Thr His Tyr Gin 
165 170 175 180 

□ ACG TGG GAG ATT CCG AAG CGG GAC GGC GGC AAG CGG ACG CTC ACC GCG 1350 

©Thr Trp Glu lie Pro Lys Arg Asp Gly Gly Lys Arg Thr Leu Thr Ala 
P 185 190 195 

Ui 

j^CCG AAG CGG GAG CTC AAG GCC GTG CAG CGC TGG GTG CTC GCG AAC GTG 13 98 

g Pro Lys Arg Glu Leu Lys Ala Val Gin Arg Trp Val Leu Ala Asn Val 

f* 200 205 210 

tar 

S GTG GAG CGG CTG CCG GTG CAC GGG GCC GCG CAC GGC TTC GTG GCG GGG 1446 
"val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala Gly 
^ 215 220 225 

^ CGC TCC ATC CTC ACC AAC GCG CTG GCC CAC CAG GGC GCG GAC GTG GTG 14 94 

Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val Val 
230 235 240 

GTG AAG GTG GAC ATG AAG GAC TTC TTC CCT TCC GTG ACG TGG CCC CGG 1542 
Val Lys Val Asp Met Lys Asp Phe Phe Pro Ser Val Thr Trp Pro Arg 
245 250 255 260 

GTC AAG GGA CTG CTG CGC AAG GGA GGA CTC CCG GAG AAC CTG GCG ACG 1590 
Val Lys Gly Leu Leu Arg Lys Gly Gly Leu Pro Glu Asn Leu Ala Thr 

265 270 275 

CTC CTG GCG CTG CTC TCC ACC GAG GCC CCG CGC GAG GTG GTG CGG TTC 1638 
Leu Leu Ala Leu Leu Ser Thr Glu Ala Pro Arg Glu Val Val Arg Phe 
280 285 290 

CGG GGA GAG ACG CTG TAC GTG GCC AAG GGC CCT CGC GCG CTG CCC CAG 1686 
Arg Gly Glu Thr Leu Tyr Val Ala Lys Gly Pro Arg Ala Leu Pro Gin 
295 300 305 



v 



GGG GCC CCC ACC TCT CCG GCG CTG ACG AAC GCG CTG TGC CTG CGG CTG 1734 
Gly Ala Pro Thr Ser Pro Ala Leu Thr Asn Ala Leu Cys Leu Arg Leu 
310 315 320 

GAC AAG CGG CTC TCG GCG CTG TCG AAG CGG CTG GGC TTC ACG TAC ACG 1782 
Asp Lys Arg Leu Ser Ala Leu Ser Lys Arg Leu Gly Phe Thr Tyr Thr 
325 330 335 340 

CGC TAT GCG GAT GAC CTG ACG TTC TCC TGG CGG CGG GCG AAG AAG TCC 183 0 

Arq Tyr Ala Asp Asp Leu Thr Phe Ser Trp Arg Arg Ala Lys Lys Ser 

345 350 355 

CGG CAG AAG GAA CTC CCC CTG GCG GAT GCG CCG GTG GCG CTG CTC CTG 1878 
Ara Gin Lys Glu Leu Pro Leu Ala Asp Ala Pro Val Ala Leu Leu Leu 
360 365 370 

GCG CGG GTG AAG GGT GTG CTG GAG GCC GAG GGT TTC ACG CTG CAC CCG 1926 
Ala Arg Val Lys Gly Val Leu Glu Ala Glu Gly Phe Thr Leu His Pro 
375 380 385 

□ GAC AAG ACG CGG GTG CAG CGC AAG GGC AGC CGG CAG CGG GTG ACG GGG 1974 
63 Asp Lys Thr Arg Val Gin Arg Lys Gly Ser Arg Gin Arg Val Thr Gly 
' 390 395 400 

5 CTC GTG GTG AAC GAG GCC CCC GAG GGC GTT CCG GGT GCC CGG GTG CCC 2022 

Leu Val Val Asn Glu Ala Pro Glu Gly Val Pro Gly Ala Arg Val Pro 
=3a 0 5 410 415 420 

** CGC GAT GTG GTG CGG CGG CTG CGC GCG GCG ATC CAC AAC CGG GAG CAG 2070 
U Arg Asp val Val Arg Arg Leu Arg Ala Ala He His Asn Arg Glu Gin 
ft 425 430 435 

H GGC AAG CCC GGC CCC ACC GGG GAG ACG CTG GAG CAG CTC AAG GGG CTC 2118 
^ Gly Lys Pro Gly Pro Thr Gly Glu Thr Leu Glu Gin Leu Lys Gly Leu 

-43 440 445 450 

M 

GCG GCC TTC CTT CAC ATG ACG GAC GCG GAG AAG GGC CGC GCC TTC CTG 2166 
Ala Ala Phe Leu His Met Thr Asp Ala Glu Lys Gly Arg Ala Phe Leu 
455 460 465 

CGA CGG CTG GAG GCC CTC GAG AAG CGC CAG ACC GCC TGACCCTCAC 2212 
Arg Arg Leu Glu Ala Leu Glu Lys Arg Gin Thr Ala 
470 475 480 

TGGTCGTCCG GGGCATCGCA GCGGGCGCCG GGACGGACCG TCACCCCCCA GATCTCCATG 2272 

CCATGCTGGG GATTCTGGGC GGTGAAGAAG ACTTCCCAGC CGAGACGGAC GAAGCCCTGC 2332 

GGATCCGATG ACTCCTCGCC CGGGGCGATC TCCCGGAGGG GCACCGTTCC GACGTCCGTG 2392 

CCATTGCTCA CCCAGGGCTC CCGGCCCCAG CCTTGGGTGT CCGCCGAGAA GAAGAGCAGC 2452 

CCGGAGATGG CCGTCAGGTT CTCCGGCGAC GCATCCTCGG GGCCCGGCGC CAAATCCTTC 2512 



AGCAGCAGGG 


TGCCCTTGGC 


GGTGCCATCG 


CTGGACCACA 


GCTCCCGGCC 


GTGGAGGCTG 


2572 


TCACTCGCGG 


CGAAGTAGAG 


CATCCCATTC 


AGCGCCTTGA 


TGGCGCTGGG 


CGCCGAGCTG 


2632 


TCCGGACCCG 


GCCAGATGTC 


CTTCACCCGG 


ACCGTGCCAT 


GCGACGTGCC 


ATCGCTGACC 


2692 


CACAGCTCCT 


CGCCCTCGGG 


CTGGCCCCAG 


AACTCGGGCT 


CGCCTCCCCC 


GGCGCTGAAG 


2752 


AAGATCTTCC 


CCCCGAGCGC 


CGTGAGATCA 


TGCGGATAGA 


GGCCGGGGAA 


GAAGCGCAGC 


2812 


TGCTCGGAGA 


CGGTGCCTCT 


GGAGCACCAC 


AGGCTGGCCT 


CGCCTTCGTC 


ATTGTCGAGC 


2872 


a fir* a an a an a 




CGCCGCGGTG 


AACGCGGAGA 


GGAAGTTGTC 


CTCGGGGCCC 


2932 


GTGAAGACAG 


ACGTGGTGCT 


GGACAGCCCC 


AGGCTGCGCC 


AGATGAACAC 


CTCGTCATTG 


2992 


ACGTTGGCCA 


CGAAGAAGAG 


CGCATCGCCG 


ACCCGGGTGA 


GCCGGCGCGG 


GCTGGAGCTG 


3052 


CCGGGCAC 












3060 



fig (2) INFORMATION FOR SEQ ID NO: 39: 

3 (i) SEQUENCE CHARACTERISTICS: 

Ut (A) LENGTH: 2788 base pairs 
(B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
q (D) TOPOLOGY: linear 



U 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 103 



~ ( ix) FEATURE : 

M (A) NAME /KEY: CDS 

(B) LOCATION: 707.. 1654 

( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1644.. 2591 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



T TTC GAG AAG CGC CAT ACC AAA CAG GGG ATA CAG ACC AAC CTG ACG 46 
Phe Glu Lys Arg His Thr Lys Gin Gly He Gin Thr Asn Leu Thr 
15 10 15 

CTG AAA GAG GAA AGC TAC GGC GAC TGG CTG CCG AAG TGC GAC GAC CCC 94 
Leu Lys Glu Glu Ser Tyr Gly Asp Trp Leu Pro Lys Cys Asp Asp Pro 

20 25 30 

GCA GCA ACA TAACCTCACT CAGACCGGCA ACAGCCGGTC TTTTCCTTTC 143 



% 




Ala Ala Thr 




TGGCCATTGC 


CACAAGGTGA 


ACAATCCACT 


GTTCACCCTT 


CACCGTTTAT 


TCACCCTTTA 


203 


TCACTATGAA 


ATTATTAATA 


AAAAACCAGA 


GGTGAACAGT 


GTGAACAGTA 


AAACCTGAAA 


263 


AAACTTTTTA 


TCACCCCGCG 


CATCGCCCGA 


CTGGACAGAT 


CCAGAACGAG 


CAAAAATCAC 


3 23 


AAAGGTGACG 


AGTCGACTGT 


T C ACT CTT C A 


CCAACTCATC 


ACCACCTAAC 


CACATGATAT 


"3 Q 1 


AAAATGATAA 


ATAATCGAGG 


TGAACAGTTA 


AATGCAAAAA 


AACTTTTTCT 


CAGCTCTTGG 


A A "X 




aTTZVATTPAP 

r\ x J. -rtJT. X x v^-riv- 


ATCAATAGCT 


TTCCTCTTGA 


ATCCTCTTGA 


GGTTTATGAG 


503 


AGCGTAACAG 


AGCCAAACCT 


AGCATTTTAT 


GGGTTAATAG 


CCCATCGCGC 


ATGAGTCATG 


563 


GTTTCGCCTA 


GTATTTTAGC 


TATGCCCGTC 


GTTCAGTTCG 


CTGAGCGGCG 


GCTGGGGGCC 


623 


ACCGATCAGC 


GAACTGATCG 


ACGTGCTCAA 


GTAGGTTTGG 


CTCTTTTAGT 


CCTCTACCAT 


683 


5 CAAGGTGCAT 


AAGGATATTC 


TCG ATG CTG ACT CAG CTA AAA AAA AAT GGT 
Met Leu Thr Gin Leu Lys Lys Asn Gly 


733 



ACT GAG GTA TCT AGA GCA ACC GCG TTA TTT TCA TCA TTC GTT GAA AAG 781 
H Thr Glu Val Ser Arg Ala Thr Ala Leu Phe Ser Ser Phe Val Glu Lys 
m 1Q 15 20 25 

L AAC AAA GTA AAA TGT CCT GGT AAT GTA AAA AAA TTC GTC TTT CTG TGT 82 9 

W Asn Lys Val Lys Cys Pro Gly Asn val Lys Lys Phe val Phe Leu Cys 
UJ 30 35 40 

0 GGT GCT AAC AAA AAC AAT GGA GAA CCA TCA GCA AGA CGA TTG GAA TTA 877 
y3 Gly Ala Asn Lys Asn Asn Gly Glu Pro Ser Ala Arg Arg Leu Glu Leu 
M 45 50 55 

ATA AAT TTT TCT GAA AGG TAT TTG AAT AAC TGT CAC TTT TTT CTT GCT 925 
lie Asn Phe Ser Glu Arg Tyr Leu Asn Asn Cys His Phe Phe Leu Ala 
60 65 70 

GAA CTA GTT TTC AAA GAA TTA AGC ACC GAT GAA GAA TCA TTA TCT GAT 973 
Glu Leu Val Phe Lys Glu Leu Ser Thr Asp Glu Glu Ser Leu Ser Asp 
75 80 85 

AAT TTA TTA GAT ATC GAA GCT GAC TTA TCT AAA TTA GCT GAT CAT ATT 1021 
Asn Leu Leu Asp He Glu Ala Asp Leu Ser Lys Leu Ala Asp His He 
90 95 100 105 

ATC ATT GT^.TTA GAA AGT TAT TCA TCT TTC ACG GAA CTT GGT GCA TTC 1069 
He He Vc Leu Glu Ser Tyr Ser Ser Phe Thr Glu Leu Gly Ala Phe 

110 115 120 

GCA TAC AGC AAG CAA TTA CGC AAG AAA TTA ATA ATA GTT AAC AAT ACA HI 7 




Ala Tyr Ser Lys Gin Leu Arg Lys Lys Leu He He Val Asn Asn Thr 
125 130 135 

AAA TTT ATA AAT GAG AAA TCA TTT ATA AAT ATG GGA CCA ATA AAG GCT 1165 
Lys Phe He Asn Glu Lys Ser Phe He Asn Met Gly Pro He Lys Ala 
140 145 150 

ATT ACT CAG CAA TCA CAA CAA TCT GGT CAT TTC TTA CAT TAT AAA ATG 1213 
He Thr Gin Gin Ser Gin Gin Ser Gly His Phe Leu His Tyr Lys Met 
155 160 165 

ACA GAA GGT ATT GAA AGT ATA GAG CGC TCT GAT GGG ATT GGC GAA ATA 1261 
Thr Glu Gly He Glu Ser He Glu Arg Ser Asp Gly He Gly Glu He 
170 175 180 185 

TTC GAC CCC CTA TAT GAT ATT CTT TCT AAG AAC GAC AGA GCA ATT TCA 1309 
Phe Asp Pro Leu Tyr Asp He Leu Ser Lys Asn Asp Arg Ala He Ser 

190 195 200 

IAGA ACT TTA AAA AAA GAA GAG TTA GAT CCT TCC AGT AAC TTC AAT AAA 1357 
f*i Arg Thr Leu Lys Lys Glu Glu Leu Asp Pro Ser Ser Asn Phe Asn Lys 
1 205 210 215 

2! GAC TCA GTA CGA TTT ATT CAT GAC GTA ATT TTT GTA TGT GGT CCT TTG 1405 
Sasp Ser Val Arg Phe He His Asp Val He Phe Val Cys Gly Pro Leu 
S 220 225 230 

W CAA CTT AAT GAA CTC ATC GAA ATA ATC ACA AAA ATA TTT GGC ACA GAA 1453 
'"Gin Leu Asn Glu Leu He Glu He He Thr Lys He Phe Gly Thr Glu 
235 240 245 

O 

y AGC CAT TAC AAA AAA AAT CTT CTA AAG CAC CTT GGT ATT CTA ATA GCT 15 01 

S Ser His Tyr Lys Lys Asn Leu Leu Lys His Leu Gly He Leu lie Ala 
W 250 255 260 265 

3 ATT AGA ATA ATA TCA TGC ACA AAT GGG ATT TAT TAT TCT TTG TAT AAA 154 9 

" He Arq He He Ser Cys Thr Asn Gly He Tyr Tyr Ser Leu Tyr Lys 

270 275 280 

GAA TAT TAT TTT AAA TAT GAC TTT GAC ATT GAC AAC ATA TCA TCA ATG 15 97 

Glu Tyr Tyr Phe Lys Tyr Asp Phe Asp He Asp Asn He Ser Ser Met 
285 290 295 

TTT AAA GTT TTT TTC CTC AAG AAC AAG CCA GAA AGG ATG AGG GTA TAT 1645 
Phe Lys Val Phe Phe Leu Lys Asn Lys Pro Glu Arg Met Arg Val Tyr 
300 305 310 

GAG AAT ATA TAGCCTAATT GATTCTCAGA CATTGATGAC TAAGGGATTT 1694 
Glu Asn He 
315 

GCTTCTGAAG TAATGCGATC ACCTGAGCCG CCAAAAAAAT GGGATATAGC TAAGAAAAAA 1754 
GGAGGTATGA GAACAATTTA TCACCCGTCA TCAAAAGTTA AATTAATTCA ATATTGGTTA 1814 




ATGAATAATG TTTTTTCGAA GCTCCCAATG CATAATGCTG CATATGCATT TGTTAAAAAC 1874 

CGATCAATAA AAAGCAATGC TTTATTACAT GCCGAATCAA AGAATAAGTA TTATGTGAAA 1934 

ATAGATCTCA AAGATTTTTT CCCTTCAATA AAATTTACTG ATTTTGAGTA CGCATTCACT 1994 

CGTTATCGAG ATCGCATTGA ATTTACTACA GAATATGATA AGGAGTTACT ACAACTTATA 2054 

AAAACGATCT GCTTTATATC AGATAGCACT CTCCCTATCG GGTTTCCTAC ATCTCCATTA 2114 

ATTGCAAACT TTGTGGCAAG AGAACTTGAT GAAAAACTGA CGCAAAAACT AAATGCAATT 2174 

GATAAACTTA ATGCCACTTA TACACGATAT GCTGATGATA TTATTGTCTC TACAAATATG 2234 

AAAGGGGCTA GCAAATTAAT TCTGGATTGT TTTAAAAGAA CAATGAAAGA GATTGGTCCA 2294 

GACTTTAAAA TTAACATTAA AAAATTTAAG ATTTGTAGTG CTTCGGGAGG AAGTATAGTA 2354 

GTTACCGGAT TGAAAGTTTG CCACGATTTT CATATTACAT TACATAGATC AATGAAAGAT 2414 

pAAAATAAGAT TGCATCTTTC TCTTTTATCA AAGGGCATAT TAAAAGATGA AGATCATAAT 2474 

-AAACTTTCTG GTTATATTGC TTATGCAAAA GATATAGACC CTCATTTTTA TACAAAACTG 2 534 

KaACAGAAAAT ATTTTCAAGA AATAAAATGG ATTCAGAATC TCCACAACAA AGTTGAATAA 2594 

H ACTTTATATT TTGGATGCAC CCCAATAACT TCATTGATTA AATTGGGAAC AATATAGGCT 2654 

TTTCAGGATG ACCTACACTC TAGAGAATGT GTATACAAAA GTGTATAAGT TATTTTCAAA 2714 

S CCTATATAAA ATACAGCAAA ATCAATGCAT TGGCGGCATT TTACCACTCC TGTGATCTTC' 2774 
^ CGCCAAAATG CCTC 



I: 



Si (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Arg He Tyr Ser Leu He Asp Ser Gin Thr Leu Met Thr Lys Gly 
1 " 5 10 15 

Phe Ala Ser Glu Val Met Arg Ser Pro Glu Pro Pro Lys Lys Trp Asp 
20 25 30 

He Ala Lys Lys Lys Gly Gly Met Arg Thr He Tyr His Pro Ser Ser 
35 40 45 



2788 




Lys Val Lys Leu lie Gin Tyr Trp Leu Met Asn Asn Val Phe Ser Lys 
50 55 60 

Leu Pro Met His Asn Ala Ala Tyr Ala Phe Val Lys Asn Arg Ser lie 
65 70 75 80 

Lys Ser Asn Ala Leu Leu His Ala Glu Ser Lys Asn Lys Tyr Tyr Val 

85 90 • 95 

Lys lie Asp Leu Lys Asp Phe Phe Pro Ser ile Lys Phe Thr Asp Phe 
100 105 110 

Glu Tyr Ala Phe Thr Arg Tyr Arg Asp Arg Ile Glu Phe Thr Thr Glu 
115 120 125 

Tyr Asp Lys Glu Leu Leu Gin Leu lie Lys Thr Ile Cys Phe Ile Ser 
130 135 140 

Asp Ser Thr Leu Pro Ile Gly Phe Pro Thr Ser Pro Leu Ile Ala Asn 
145 150 155 160 

0 Phe Val Ala Arg Glu Leu Asp Glu Lys Leu Thr Gin Lys Leu Asn Ala 
m 165 170 175 

gg Ile Asp Lys Leu Asn Ala Thr Tyr Thr Arg Tyr Ala Asp Asp Ile Ile 
P 180 185 190 

3-3 

2 Val Ser Thr Asn Met Lys Gly Ala Ser Lys Leu Ile Leu Asp Cys Phe 
195 200 205 

~ Lys Arg Thr Met Lys Glu lie Gly Pro Asp Phe Lys Ile Asn Ile Lys 
*£{ 210 " 215 220 

Lys Phe Lys Ile Cys Ser Ala Ser Gly Gly Ser Ile Val Val Thr Gly 
225 230 235 240 

Leu Lys Val Cys His Asp Phe His lie Thr Leu His Arg Ser Met Lys 

245 250 255 

Asp Lys Ile Arg Leu His Leu Ser Leu Leu Ser Lys Gly Ile Leu Lys 
260 265 270 

Asp Glu Asp His Asn Lys Leu Ser Gly Tyr Ile Ala Tyr Ala Lys Asp 
275 280 285 

lie Asp Pro His Phe Tyr Thr Lys Leu Asn Arg Lys Tyr Phe Gin Glu 
290 295 300 

Ile Lys Trp lie Gin Asn Leu His Asn Lys Val Glu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 548.. 1507 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

TGGCATCTAT TAAGAAGGTT AGGAAAGAAA ATAAAGTATC AAAAGATATT GGAAATATAT 60 

TATACGCAGA GCGTTTCTAT TGCCTTGTAT CTATTTACTG GATAGTGTCA ACTACCGCAC 120 

ACTGTGTGAA CTAGCTTTTA AAGCGATAAA GCAAGATGAT GTTTTATCTA AAATTATTGT 180 

TAGATCCGTT GTTTCTCGTC TAATAAATGA ACGAAAAATA CTTCAAATGA CTGATGGTTA 240 

TCAGGTCACT GCTTTGGGGG CTAGCTATGT TAGGAGCGTC TTTGATAGAA AGACACTTGA 300 

CCGATTGCGG CTTGAGATTA TGAATTTTGA AAACCGTAGA AAATCAACAT TTAACTATGA 360 

TAAGATTCCG TATGCGCACC CTTAGCGAGA GGTTTATCAT TAAGGTCAAC CTCTGGATGT 420 

TGTTTCGGCA TCCTGCATTG AATCTGAGTT ACTGTCTGTT TTCCTTGTTG GAACGGAGAG 480 

CATCGCCTGA TGCTCTCCGA GCCAACCAGG AAACCCGTTT TTTCTGACGT AAGGGTGCGC 540 

AACTTTC ATG AAA TCC GCT GAA TAT TTG AAC ACT TTT AGA TTG AGA AAT 589 
Met Lys Ser Ala Glu Tyr Leu Asn Thr Phe Arg Leu Arg Asn 
15 10 

CTC GGC CTA CCT GTC ATG AAC AAT TTG CAT GAC ATG TCT AAG GCG ACT 63 7 

Leu Gly Leu Pro Val Met Asn Asn Leu His Asp Met Ser Lys Ala Thr 
15 20 25 30 

CGC ATA TCT GTT GAA ACA CTT CGG TTG TTA ATC TAT ACA GCT GAT TTT 685 
Arg He Ser Val Glu Thr Leu Arg Leu Leu He Tyr Thr Ala Asp Phe 

35 40 45 

CGC TAT AGG ATC TAC ACT GTA GAA AAG AAA GGC CCA GAG AAG AGA ATG 733 
Arg Tyr Arg He Tyr Thr Val Glu Lys Lys Gly Pro Glu Lys Arg Met 
50 55 60 

AGA ACC ATT TAC CAA CCT TCT CGA GAA CTT AAA GCC TTA CAA GGA TGG 781 
Arg Thr He Tvr Gin Pro Ser Arg Glu Leu Lys Ala Leu Gin Gly Trp 
65 * 70 75 

GTT CTA CGT AAC ATT TTA GAT AAA CTG TCG TCA TCT CCT TTT TCT ATT 829 
Val Leu Arg Asn He Leu Asp Lys Leu Ser Ser Ser Pro Phe Ser He 
80 85 90 



GGA TTT GAA AAG CAC CAA TCT ATT TTG AAT AAT GCT ACC CCG CAT ATT 877 
Gly Phe Glu Lys His Gin Ser He Leu Asn Asn Ala Thr Pro His He 
95 100 105 HO 

GGG GCA AAC TTT ATA CTG AAT ATT GAT TTG GAG GAT TTT TTC CCA AGT 925 
Gly Ala Asn Phe He Leu Asn He Asp Leu Glu Asp Phe Phe Pro Ser 

115 120 125 

TTA ACT GCT AAC AAA GTT TTT GGA GTG TTC CAT TCT CTT GGT TAT AAT 973 
Leu Thr Ala Asn Lys Val Phe Gly Val Phe His Ser Leu Gly Tyr Asn 
130 135 140 

CGA CTA ATA TCT TCA GTT TTG ACA AAA ATA TGT TGT TAT AAA AAT CTG 1021 
Arg Leu He Ser Ser Val Leu Thr Lys He Cys Cys Tyr Lys Asn Leu 

145 150 155 

CTA CCA CAA GGT GCT CCA TCA TCA CCT AAA TTA GCT AAT CTA ATA TGT 1069 
Leu Pro Gin Gly Ala Pro Ser Ser Pro Lys Leu Ala Asn Leu He Cys 
160 165 170 

iHTCT AAA CTT GAT TAT CGT ATT CAG GGT TAT GCA GGT AGT CGG GGC TTG 1117 

Ser Lys Leu Aso Tyr Arg He Gin Gly Tyr Ala Gly Ser Arg Gly Leu 
% 175 180 185 190 

J ATA TAT ACG AGA TAT GCC GAT GAT CTC ACC TTA TCT GCA CAG TCT ATG 1165 
I He Tyr Thr Arg Tyr Ala Asp Asp Leu Thr Leu Ser Ala Gin Ser Met 
U 195 200 205 

N= AAA AAG GTT GTT AAA GCA CGT GAT TTT TTA TTT TCT ATA ATC CCA AGT 1213 
» Lys Lys Val Val Lys Ala Arg Asp Phe Leu Phe Ser He He Pro Ser 
210 215 220 



O GAA GGA TTG GTT ATT AAC TCA AAA AAA ACT TGT ATT AGT GGG CCT CGT 1261 
ui Glu Gly Leu Val He Asn Ser Lys Lys Thr Cys He Ser Gly Pro Arg 

3 225 230 235 

' AGT CAG AGG AAA GTT ACA GGT TTA GTT ATT TCA CAA GAG AAA GTT GGG 1309 
Ser Gin Arg Lys val Thr Gly Leu Val He Ser Gin Glu Lys Val Gly 
240 245 250 

ATA GGT AGA GAA AAA TAT AAA GAA ATT AGA GCA AAG ATA CAT CAT ATA 13 57 

He Gly Arg Glu Lys Tyr Lys Glu He Arg Ala Lys He His His He 
255 260 265 270 

TTT TGC GGT AAG TCT TCT GAG ATA GAA CAC GTT AGG GGA TGG TTG TCA 14 05 

Phe Cys Gly Lys Ser Ser Glu He Glu His Val Arg Gly Trp Leu Ser 

275 280 285 

TTT ATT TTA AGT GTG GAT TCA AAA AGC CAT AGG AGA TTA ATA ACT TAT 14 53 

Phe He Leu Ser Val Asp Ser Lys Ser His Arg Arg Leu He Thr Tyr 
290 295 300 

ATT AGC AAA TTA GAA AAA AAA TAT GGA AAG AAC CCT TTA AAT AAA GCG 1501 
He Ser Lys Leu Glu Lys Lys Tyr Gly Lys Asn Pro Leu Asn Lys Ala 



♦ % 




305 310 315 

AAG ACC TAATGGTCTT CGTTTTAAAA CTAAAGCTCA TAGGTTGAAA AATTGAGCAC 1557 
Lys Thr 
320 

TTCTTCGTCC AACCAGTTAT TTAGTTCCTG CAATCGTTTC TGCAG 1602 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 0 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 396.. 1352 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

® TCACCCTGAA AGACCTGATT GCTTACCTGG AAGAGAAGCC GGAAATGGCG GAACATCTGG 60 

W CGGCGGTTAA GGCCTATCGC GAAGAGTTCG GCGTTTAAAA ATATGCGCTG TGCAGGGTTT 120 

5 TTGCTGTGCG CAGCGTGATG CGCTTCAAGA TATCGTGTTA ATCTGCTTTC GCCAGCAGTG 180 

y GCAATAGCGT TTCCGGCCTT TTGTGCCGGG AGGGTCGGCG AGTCGCTGAC TTAACGCCAG 24 0 

jj TAGTATGTCC ATATACCCAA AGTCGCTTCA TTGT AC CTGA GTACGCTTCG CGTACGTCGC 300 

GCTGACGCGC TCAGTACAGT TACGCGCCTT CGGGATGGTT TAATGGTATT GCCGCTGTTG 360 

GCGCCTCTTT TGGCCGCCGT GATGTGGAGA GTGGA ATG GAT GCT ACC CGG ACA 413 

Met Asp Ala Thr Arg Thr 
1 5 

ACC CTT CTG GCG CTC GAT TTG TTC GGC TCG CCG GGC TGG AGC GCC GAT 461 
Thr Leu Leu Ala Leu Asp Leu Phe Gly Ser Pro Gly Trp Ser Ala Asp 
10 15 20 

AAA GAA ATA CAG CGA CTG CAT GCG CTC AGT AAT CAT GCC GGA CGC CAT 509 
Lys Glu lie Gin Arg Leu His Ala Leu Ser Asn His Ala Gly Arg His 
25 30 35 

TAC CGA CGC ATT ATT CTT TCT AAA CGC CAC GGT GGT CAG CGG CTG GTG 557 
Tyr Arg Arg He He Leu Ser Lys Arg His Gly Gly Gin Arg Leu Val 
40 45 50 

TTA GCC CCT GAT TAC TTG CTC AAA ACC GTA CAG CGC AAC ATT CTT AAG 60 5 




Leu Ala Pro Asp Tyr Leu Leu Lys Thr Val Gin Arg Asn lie Leu Lys 
55 60 65 70 

AAC GTC CTT TCA CAA TTT CCG CTT TCC CCT TTT GOT ACA GCC TAC CGA 653 

Asn Val Leu Ser Gin Phe Pro Leu Ser Pro Phe Ala Thr Ala Tyr Arg 

75 80 85 

CCA GGT TGC CCA ATC GTC AGC AAC GCG CAG CCA CAC TGC CAA CAG CCG 701 
Pro Gly Cys Pro lie Val Ser Asn Ala Gin Pro His Cys Gin Gin Pro 
90 95 100 

CAG ATC CTG AAA CTC GAT ATC GAA AAC TTT TTC GAT AGC ATT AGC TGG 749 
Gin lie Leu Lys Leu Asp lie Glu Asn Phe Phe Asp Ser lie Ser Trp 
105 110 115 

TTA CAG GTC TGG CGT GTG TTT CGC CAG GCC CAG TTG CCA CGT AAT GTG 797 
Leu Gin Val Trp Arg Val Phe Arg Gin Ala Gin Leu Pro Arg Asn Val 
120 125 130 

GTA ACC ATG CTG ACC TGG ATT TGT TGT TAT AAC GAC GCG TTA CCG CAG 845 
Val Thr Met Leu Thr Trp lie Cys Cys Tyr Asn Asp Ala Leu Pro Gin 
J 135 140 145 150 

gj GGG GCA CCA ACT TCG CCA GCC ATT TCC AAT CTT GTG ATG CGC CGT TTT 893 
g Gly Ala Pro Thr Ser Pro Ala lie Ser Asn Leu Val Met Arg Arg Phe 
« 155 160 165 

[jj GAT GAA CGC ATA GGG GAA TGG TGT CAG GCT CGG GGA ATT ACC TAC ACC 941 
^ Asp Glu Arg He Gly Glu Trp Cys Gin Ala Arg Gly He Thr Tyr Thr 
;L 17 0 175 180 

W CGC TAC TGC GAT GAC ATG ACC TTT TCA GGT CAC TTC AAT GCC CGC CAG 989 
O Arg Tyr Cys Asp Asp Met Thr Phe Ser Gly His Phe Asn Ala Arg Gin 
W 185 190 195 

SJ GTT AAA AAT AAA GTG TGC GGA TTG TTA GCG GAG CTG GGC CTG AGC CTC 103 7 

Val Lys Asn Lys Val Cys Gly Leu Leu Ala Glu Leu Gly Leu Ser Leu 
200 205 210 

AAT AAA CGC AAA GGC TGC CTG ATA GCT GCC TGT AAG CGC CAG CAA GTA 1085 
Asn Lys Arg Lys Gly Cys Leu He Ala Ala Cys Lys Arg Gin Gin Val 
215 220 225 230 

ACC GGG ATT GTT GTT AAT CAC AAG CCA CAG CTT GCC CGT GAA GCG CGC 1133 
Thr Gly He Val Val Asn His Lys Pro Gin Leu Ala Arg Glu Ala Arg 

235 240 245 

CGG GCG CTG CGT CAG GAG GTG CAT TTG TGC CAA AAA TAT GGC GTT ATT 1181 
Arg Ala Leu Arg Gin Glu Val His Leu Cys Gin Lys Tyr Gly Val lie 
250 255 260 

TCG CAT CTT AGT CAT CGT GGT GAA CTT GAT CCT TCT GGC GAT CTC CAC 1229 
Ser His Leu Ser His Arg Gly Glu Leu Asp Pro Ser Gly Asp Leu His 
265 270 275 



♦ 4 



GCA 


CAG 


GCA 


ACG 


GCG 


TAT 


CTT 


TAT 


GCT 


TTG 


CAG 


GGA AGA 


ATA 


AAC 


TGG 


Ala 


Gin 


Ala 


Thr 


Ala 


Tyr 


Leu 


Tyr 


Ala 


Leu 


Gin 


Gly Arg 


He 


Asn 


Trp 




280 










9 ft 










290 








TTA 


J. -L \J 


PAA 


ATP 


A AP 


PPT* 


nan 


HAT 


pan 


GCC 


TTT 


CAA CAG 


GCG 


AGA 


nap 


Leu 


Leu 


Gin 


He 


Asn 


Pro 


Glu 


Asp 


Glu 


Ala 


Phe 


Gin Gin 


Ala 


Arg 


Glu 


295 










300 










305 






310 


AGT 


GTA 


AAG 


CGA 


ATG 


CTG 


GTT 


GCA 


TGG 


TAAGAAAAGC GTCAGGCAGA 




Ser 


val 


Lys 


Arg 


Met 


Leu 


val 


Ala 


Trp 






















315 























CGTTTCTGCC TGACCGTTTA GGGGAGAATT ACTGCAACTG CGCGGCAATT AGCGGCCAGC 1432 
GGGCGTCAAA ATCATCCGTC GGGCGGTATT TAAACTCGCT GCGGACAAAA CGTGACAGCA 14 92 
TACCTTCACA GAAGGCCAGG ATCTGGCTTG CCAGCAGGGT TTCATCGG 154 0 



GO 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 



Uf Tyr Xaa Asp Asp 
O 1 4 

w 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44 

Ser Xaa Xaa Xaa 
l 4 



(2) INFORMATION FOR SEQ ID NO: 45: 



• 



& 



(i) SEQUENCE CHARACTERISTICS: 

(A) YengtH: 4 amino acids 

(B) tVpE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



Caa Val Thr Gly 
1 4 




